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A FUNCTIONAL LIMIT THEOREM FOR THE PROFILE 
OF SEARCH TREES 
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TU Wien, Uppsala University and J. W. Goethe University 

We study the profile X n ,k of random search trees including binary 
search trees and m-ary search trees. Our main result is a functional 
limit theorem of the normalized profile X n ±/'RX n .k for k = [alognj 
in a certain range of a. 

A central feature of the proof is the use of the contraction method 
to prove convergence in distribution of certain random analytic func- 
tions in a complex domain. This is based on a general theorem con- 
cerning the contraction method for random variables in an infinite- 
dimensional Hilbert space. As part of the proof, we show that the 
Zolotarev metric is complete for a Hilbert space. 

1. Introduction. Search trees are used in computer science as data struc- 
tures that hold data (also called keys) from a totally ordered set in order to 
support operations on the data such as searching and sorting. After having 
constructed the search tree for a set of keys, the complexity of operations 
performed on the data is identified by corresponding shape parameters of the 
search tree (examples are given below). Usually, one assumes a probabilistic 
model for the set of data or uses randomized procedures to build up search 
trees so that the resulting trees become random and the typical complexity 
of operations can be captured by computing expectations, variances, limit 
laws or tail bounds. In this paper, we study the profile of a general class of 
random search trees that includes many trees used in computer science such 
as the binary search tree and m-ary search trees with respect to functional 
limit laws. 

A random binary search tree is constructed for a set of keys as follows. 
One key, the so-called pivot, is chosen uniformly from the set of data and 
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Fig. 1. A random binary search tree for the data set {1,2,3,4,5}. In the first step, key 
2 is chosen as the pivot. For the right subtree of the root holding the keys {3,4,5}, key 4 
is chosen as a pivot. The profile of this tree is (1, 2, 2, 0, 0, . . .). 

inserted in the root of the tree. All other keys are compared with the pivot. 
Those which are smaller are used to build a random binary search tree as 
the left subtree of the root; those which are larger (or equal) than the pivot 
are used to build the right subtree of the root. For building these subtrees, 
the procedure is recursively applied. An example is given in Figure 1. 

For the general class of search trees, explained in Section 2 and studied 
in this paper, this construction rule is generalized so that nodes may hold 
m— 1 > 1 keys and have m subtrees and, further, the rule to choose the pivots 
may be more general, resulting in more balanced trees as a parameter t > 
is increased; see Section 2. For example, if m = 2, then the pivot is chosen as 
the median of 2t + 1 random elements. This more general search tree model 
reduces to the binary search tree for the choice (m,t) = (2,0). 

The depth of a key in the tree is its node's distance to the root of the tree. 
This quantity is a measure of the complexity involved in searching for the 
number inserted in that node. Other quantities, important in the context of 
computer science, are the internal path length of the tree, which is the sum 
of the depths of all keys, and the height of the tree, which is the maximal 
depth in the tree. 

In this paper, we study the profile of search trees, which is the infinite 
vector X n = (X n ^)k>o, where X n ^ is the number of keys that are stored in 
nodes with depth k. 

The profile of binary search trees (and related structures) has been in- 
tensively studied in the literature [4, 6, 7, 8, 10, 11, 12, 15, 17, 25]. Most 
results concern 1st and 2nd moments. However, there are also distributional 
results, particularly for binary search trees and recursive trees [4, 6, 15] that 
are of the form 
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for fixed a (contained in a suitable interval) . The advantage of binary search 
trees and recursive trees is that there is an underlying martingale structure 
which also allows functional limit theorems to be proven (see [4, 6] for binary 
search trees). Unfortunately, this martingale structure is (generally) missing 
in the kind of trees that we want to study. 

Our main result is the following, where we actually prove functional con- 
vergence of random functions on an interval More precisely, we use the 
space D{I') of right-continuous functions with left-hand limits equipped with 
the Skorohod topology; see Section 4 for the definition and note that when, 
as here, the limit is continuous, convergence in the Skorohod topology is 
equivalent to uniform convergence on every compact subinterval. 

In the formulation of Theorem 1.1, we also use the function Ai(z), de- 
fined in Section 3 as the dominant root of (3.4), and the stochastic process 
(Y(z),z £ B) (of analytic functions in a certain domain B containing the 
interval /) that is defined as the unique solution of a stochastic fixed point 
equation (3.7) which is discussed in Section 9, satisfying the further condi- 
tions that EY(z) = 1 and that for each x G /, there exists an s(x) > 1 such 
that E|y(z)| s ^- ) is finite and bounded in a neighborhood of x. 

Theorem 1.1. Let m > 2 and t > be given integers and let (^ n ,fc)fc>o 
be the profile of the corresponding random search tree with n keys. 

Set / = {/3 > 0:1< Ai(/3 2 ) < 2Ai(/3) - 1}, /' = {p\' x (/3) : /3 € 1} and let 
(3(a) > be defined by f3(a)X' 1 (P(a)) =a. We then have, in D(I'), that 

(1.1) ( /;- LQl ° g " J ,«€ A -± (T(0(a)),a € /')• 

Remark 1.1. From the definitions of / and I', it is not clear that they 
are in fact intervals. We will make this precise in Lemma 8.5. 

Remark 1.2. In exactly the same way, one can consider other similarly 
defined parameters. For example, in Section 11, we discuss the external 
profile. 

The proof of Theorem 1.1 is divided into several steps. After defining suit- 
able function spaces (Section 4), we show (Section 9) the following theorem, 
which states that if W n (z) '-=Yl,k-^n,k zk are the profile polynomials, then 
the normalized profile polynomials W n (z) /MW n (z) converge weakly to Y(z) 
for z contained in a suitable complex region B, where Y(z) is, as above, the 
solution of a stochastic fixed point equation (3.7). Note that convergence in 
TC(B) means uniform convergence on every compact subset of B. 
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Theorem 1.2. There exists a complex region B that contains the real 
interval (1/m, /3(a+)) ; where a + is defined in (1.3), and an analytic stochas- 
tic process (Y(z),z G B) satisfying (3.7) andMY(z) = 1, such that, in7i{B), 

Finally, we apply a suitable continuous functional (which is related to 
Cauchy's formula) in order to derive Theorem 1.1 from this property (Sec- 
tion 10). 

Important tools in this argument are Theorems 5.1 and 6.1, which show 
that one can use the contraction method with the Zolotarev metric £ s for 
random variables with values in a separable Hilbert space. (We do not know 
whether these theorems extend to arbitrary Banach spaces.) 

In the special case of binary search trees, Theorems 1.1 and 1.2 have been 
proven earlier, also in stronger versions [4, 6, 7]. 

Before we go into the details, we wish to comment on the interval I of 
Theorem 1.1. It is well known that the height of random search trees is 
of order logn. Thus, it is natural that there might be a restriction on the 
parameter a = fc/logn, where k denotes the depth. 

In fact, there are several critical values for a = kj logn, namely 

/ll 1 x -l 

• a = a := - — - + - — - H h 



t + 1 t + 2 (t + l)m-l 

1 1 1 

a = "max := I — + — "77 H h 



t + 2 t + 3 {t + l)m, 

• a = Q + , where a+ > ao is the solution of the equation 

(1.3) Ai(/?(a))-alog(/3(a))-l = 0. 

In order to explain these critical values, we must look at the expected 
profile EA^fc. If a = fc/logn < ao — s (for some e > 0), then 

EX n:k ~ (m - l)m k , 

whereas if a = k/logn > oq + e, then 

E(/3(a))n Al ^ 0, ))- alo «C 3 ( a ))- 1 



EX 



n.k 



v /2 7 r(a + /3(a) 2 A' 1 / (/3(Q)))logn 



for some continuous function E(z); see Lemma 8.3. This means that up to 
level k = aologn, the tree is (almost) complete. Note that the critical value 
k/logn = ao corresponds to z = f3 = 1/m and Ai(l/m) = 1, and thus that 

n Ai(/3(a ))-«olog(/3("o))-l _ n a logm _ m k 
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for some continuous function E\(z). 

The second critical value k/logn = a max corresponds to z = (3 = 1 and 
Ai(l) = 2. Here, we have 



[uniformly for k = a max logn + 0(\/logn)]. This means that most nodes are 
concentrated around that level. In fact, a max log?i is the expected depth. 

Finally, if a = k/logn < a+, then EX n ^ — ► oo and if a = k/logn > a+, 
then ~EX n ^ — > 0. This means that the range a = k/logn G (0, oq.) is exactly 
the range where the profile X n ^ is actually present. 

We also see that the interval /' of Theorem 1.1 is strictly contained in 
(ao,a + ), but we have a max G I'. This means that we definitely cover the 
most important range. However, it seems that Theorem 1.1 is not optimal. 
The condition Ai(/? 2 ) < 2Ai(/3) - 1 comes from the fact that we are using L 
techniques in order to derive Theorem 1.1 from Theorem 1.2. We conjecture 
that this is just a technical restriction and that Theorem 1.1 actually holds 
for a G (ao, oq_). 

Incidentally, r n ^ has a similar critical value a_ < «o that is the second 
positive solution of (1.3). If a < a_, then r n ^ — > and if a > a_, then 
r n,k ~^ oo- The two constants a_, a+ are related to the speed of the leftmost 
and rightmost particles in suitable discrete branching random walks (see 
[5]). Note that they can be also computed by 



= (rn - l)m k - r n>k , 



with 



^J2ir{a + f3{a) 2 \'>(P{a)))logn 





and 




where A_ and A+ are the two solutions of 

(t+l)(m-l)-l 

log(A + t + j) - log(m(tm + m - l)l/t\) 

3=0 
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(1.4) 

(t+l)(m-l)-l \_i 



U x+t+j ' 

Further, the expected height of m-ary search trees satisfies Ei/ n ~ a+ log n 
and the expected saturation level KH n ~ a_ logn. 

Notation. If / and g are two functions on the same domain, then / < g 
means the same as / = 0(g), that is, |/| < Cg for some constant C. 

2. Random search trees. To describe the construction of the search tree, 
we begin with the simplest case t = 0. If n = 0, the tree is empty. If 1 < n < 
m — 1, the tree consists of a root only, with all keys stored in the root. 
If n > m, we randomly select to — 1 keys that are called pivots (with the 
uniform distribution over all sets of m — 1 keys). The pivots are stored in 
the root. The m — 1 pivots split the set of the remaining n — to + 1 keys 
into m subsets I\, . . . ,I m : if the pivots are x\ < x 2 < • • • < x m -i, then I\ : = 
{xi :xi < xi}, h ■= {xi'.xi <Xi < x 2 }, ...,I m := {x t :x m _i < xj. We then 
recursively construct a search tree for each of the sets I{ of keys (ignoring Ii 
if it is empty) and attach the roots of these trees as children of the root in 
the search tree. 

In the case m = 2, t = 0, we thus have the well-studied binary search tree 
[4, 6, 7, 11, 12, 15, 26]. 

In the case t > 1, the only difference is that the pivots are selected in a 
different way, which affects the probability distribution of the set of pivots 
and thus of the trees. We now select nit + m — 1 keys at random, order them 
as j/i < • • • < y mt+m ~i and let the pivots be 2/*+i,y 2 (*+i)> • • • ,y(m-i)(t+i)- In 
the case m <n < mt + m — 1, when this procedure is impossible, we select 
the pivots by some supplementary rule (possibly random, but depending 
only on the order properties of the keys); our results do not depend on the 
choice of this supplementary rule. 

This splitting procedure was first introduced by Hennequin for the study 
of variants of the Quicksort algorithm and is referred to as the generalized 
Hennequin Quicksort (cf. Chern, Hwang and Tsai [9]). 

In particular, in the case m = 2, we let the pivot be the median of 2t + 1 
randomly selected keys (when n > 2t + 1). 

We describe the splitting of the keys by the random vector V n = (V n> i, V^2, 
• • • i Vn,m), where V n< k '■= \Ik\ is the number of keys in the fcth subset and thus 
the number of nodes in the /cth subtree of the root (including empty sub- 
trees). 

We thus always have, provided n > m, 

V n \ + V n 2 H h V nm = n-(m-l)=n+l-m 
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and elementary combinatorics, counting the number of possible choices of 
the rat + m — 1 selected keys, shows that the probability distribution is, for 
n > rat + m — 1 and n± + n% + • • • + n m = n — m + 1 , 

mi\ /nm\ 

(2.1) P{V n = (n l5 . . . , n m )} = K y" n K \ } . 



\mt+m—l) 



(The distribution of V n for m<n < mi + m — 1 is not specified.) 

In particular, for n > mi + m — 1, the components V^j are identically 
distributed and another simple counting argument yields, for n > mi + m — 1 
and < ^ < n - 1, 



(2.2) V{V n j = £} 



\t/ v(m-l)t+rn-2J 



Vmt+m— 1/ 

For example, for the binary search tree with m = 2 and i = 0, we thus 
have V n ,\ and V^,2 = n — 1 — V n -i uniformly distributed on {0, . . . , n — 1}. 

3. The profile polynomial. The recursive construction of the random 
search tree in Section 2 leads to a recursion for the profile X n = (A^ n fc)fc>o : 

(3.1) A n>fc - i + + • • • + A K imifc _!, 

jointly in A; > for every n > m, where the random vector V n = (V^i, 1^,2 5 

• • • i ^n,m) is as in Section 2 and is the same for every k > 0, and X^' } = 

(^nl) fc > ' = ■ • ■ ' m ' are independent copies of X n that are also indepen- 
dent of V n . We further have X n fi = m — 1 for n> m. For n < m — 1, we 
simply have X nj o = n and X n & = 0, A; > 1. 

Note that, by induction, X n % = when k>n. Hence, each vector X n has 
only a finite number of nonzero components. 

Let W n (z) =J2k-^n.kZ k denote the random profile polynomial. By (3.1), 
it is recursively given by W n (z) = n for n < m — 1 and 

(3.2) W n (z) 4 z w$ tl (z) + zW^l (*) + ••• + zW^l (z)+m-l, n>ra, 

where W\ > \z), j = 1, . . . , m, are independent copies of We(z) that are inde- 
pendent of V n , £ > 0. From this relation, we obtain a recurrence for the ex- 
pected profile polynomial E,W n (z). We have, using (2.2), for n > mt + m — 1, 

(3.3) EW^) = mzJ2 Y~n t+7 7 E W e (z) + m - 1. 

£=0 W+m- 1/ 

For any fixed complex z, this is a recursion of the type studied in Chern, 
Hwang and Tsai [9]. More precisely, it fits ([9], (13)) with a n = KW n (z), 
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r = mt + m — 1 and ct = mzr\/t\, while cj = for j ^ t. Further, b n = m — 1 
for n > mt + m — 1, while b n = a n = KW n (z) for n < mt + m — 1. 

It follows from [9] that the asymptotics of EW n (x) as n — ► oo depend on 
the roots of the indicial polynomial 

(mt + m- 1)! j 



A(0;z) :=r +m ^-mz 

(3.4) = 0(0 + l)---(0 + mt + m-2) 

_^ (mt + m-l)! g(g + 1) + 

using the notation x m := a; (a; + 1) • ■ • (x + m — 1) = r(a; + m)/T{x). If we set 

(3.5) F((9) :=— — (e + t)(0 + t+l).-.(e + mt + m-2) J 

mymt + m— lj! 

then 

A(6)^) = m(m * + m - 1)! g*(F(g)-z), 

which implies that the roots of A(A; z) = are 0, —1, —2, ...,—< + 1 (if t > 
1) together with the roots of F(#) = z. Let Aj(z), j = 1, . . . , (m — + 
1), denote the roots of F(6) = z (counted with multiplicities), arranged in 
decreasing order of their real parts: KAi(^) > "tft^iz) > ■ ■ ■ . 

Further, let D s , for real s, be the set of all complex z such that KAi(z) > s 
and ^HXi(z) > R\2(z) [in particular, Xi(z) is a simple root]. It is easily seen 
that the set D s is open and that \\{z) is an analytic function of z £ D s . If 
z £ D s is real, then Ai(z) must be real (and thus greater than s) because 
otherwise, Ai(z) would be another root with the same real part. 

By [9], Theorem l(i), we have the following result. Note that Kq and K\ 
[our E(z)] in [9], Theorem l(i), are analytic functions of z and Ai, and thus 
of z £ D%, and that they are positive for Ai > because bk = m — 1 > for 
k > mt + m — 1 and bk = EWfc(z) > for smaller k. (See also Lemma 8.2 and 
the Appendix.) 

Lemma 3.1. Ifz£Di, then 

EW n (z) = (E{z)+o(l))n Xl ^- 1 
for some analytic function E(z) with E(z) > for z £ D\ n (0, oo). 



Lemma 3.2. The set D\ is an open domain in the complex plane that 
contains the interval (1/m, oo). 
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[Lemma 3.2 will be proven in a more general context in Lemma 8.1. Note 
that F(l) = 1/m and thus Ai(l/m) = 1.] 

Set M n (z) = W n (z)/G n (z), where G n (z) = EW n (z). Then (3.2) can be 
rewritten as 

M (z) ± GVn ' l{z) zM {1) (z) + --- + Gv( l -M (m) (z) + m ~ l 
Mn[z) ~ G n (z) zM VnA z )+ + Gn{z) zM v n J z > + Gn{z y 

Note that Gv(z), where V is an integer- valued random variable, is con- 
sidered as the random variable EW n (z)\ n= y and not as Eiyy(z), that is, the 
expected value is only taken with respect to X n . Next, let the random vec- 
tor V = (Vi, V2, ■ ■ ■ , V m ) be supported on the simplex A = {(si, . . . , s m ) : sj > 
0, si + ■ — h s m = 1} with density 

J{S\, . . . , S m ) — [t\) m { s l''' s rn) 5 

where t > is the same integer parameter as above. (This is known as a 
Dirichlet distribution.) It is easy to show that 

Id 

(3.6) — V n — >V as n ^ 00. 

n 

Remark 3.1. For n > mt + m — 1, the shifted random vector (V ni i — 
t, . . . , V n ,m ~ t) has a multivariate Polya-Eggenberger distribution that can 
be defined as the distribution of the vector of the numbers of balls of different 
color drawn in the first n — (mt + m — 1) draws from an urn with balls of m 
colors, initially containing t + 1 balls of each color, where we draw balls at 
random and replace each drawn ball together with a new ball of the same 
color (see, e.g., Johnson and Kotz [20], Section 4.5.1). 

This distribution can be obtained by first taking a random vector V 
with the Dirichlet distribution above and then a multinomial variable with 
parameters n — (mt + m — 1) and V ([20], Section 4.5.1). Using this repre- 
sentation, (3.6) follows immediately from the law of large numbers, even in 
the stronger form V n /n V. 



It follows from (3.6) and Lemma 3.1 that 

G Vn,j ( Z ) d V \!(Z)-1 

G n (z) ~^ i 

if z € D\ and E(z) 7^ 0. Hence, if M n (z) has a limit (in distribution) Y(z) 
for some z £ D\ with E(z) 7^ 0, then this limit must satisfy the stochastic 
fixed point equation 

Y(z) = zV* l{z) - l YV(z) + zV} l{z) - 1 Y^)(z) + ■■■ + zV^- 1 Y^ m \z), 

(3.7) 
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where Y^\z) are independent copies of Y(z) that are independent of V. 
[Note that z € D\ and E(z) / imply that G n (z) — ► oo.] 

In Section 9, we will show that this limit relation is actually true in a 
suitable domain, even in a strong sense, as asserted in Theorem 1.2. We will 
also see that we have a unique solution of this stochastic fixed point equation 
under the assumption E7(z) = 1 and a certain integrability condition. 

4. Function spaces. For functions defined on an interval ICR, we use 
the space D(I) of right-continuous functions with left-hand limits equipped 
with the Skorohod topology. A general definition of this topology is that 
f n — > f as n — > oo if and only if there exists a sequence A n of strictly increas- 
ing continuous functions that map I onto itself such that X n (x) — ► x and 
fn(X n (x)) — > f(x), uniformly on every compact subinterval of /; see, for ex- 
ample, [2], Chapter 3, (I = [0,1]), [24], [18], Chapter VI, [21], Appendix A2 
([0, oo)), [19], Section 2. It is of technical importance that this topology 
can be induced by a complete, separable metric [2], Chapter 14, [18], Theo- 
rem VI. 1.14, [21], Theorem A2.2. Note that it matters significantly whether 
or not the endpoints are included in the interval /, but we can always re- 
duce to the case of compact intervals because /„ — > f in D(I) if and only 
if f n — ► f in D(Jk) for an increasing sequence of compact intervals Jk with 
U Jf~ = I. In particular, when / is continuous, f n —*fin D(I) if and only if 
f n —*f uniformly on every compact subinterval. Similarly, if F n and F are 
random elements of D(I) and F is a.s. continuous, then F n — F in D(I) 
if and only if F n — F in D{ J) for every compact subinterval J C I. 

For analytic functions on a domain (i.e., a nonempty open connected set) 
D C C, we will use two topological vector spaces. 

• 7~L(D) is the space of all analytic functions on D with the topology of 
uniform convergence on compact sets. This topology can be defined by 
the family of seminorms / 1— > sup^- where K ranges over the compact 
subsets of D. 7i(D) is a Frechet space, that is, a locally convex space 
with a topology that can be defined by a complete metric, and it has 
(by Montel's theorem on normal families) the property that every closed 
bounded subset is compact (see, e.g., [28], Chapter 1.45, or [29], Example 
10.11 and Theorem 14.6). It is easily seen that the topology is separable 
[e.g., by regarding TC(D) subspace of Cq°(D)]. 

• B{D) is the Bergman space of all square-integrable analytic functions on 
D, equipped with the norm given by ||/||gm) = Jd \f(z)\ 2 dm(z), where 
m is the two-dimensional Lebesgue measure. 13(D) can be regarded as a 
closed subspace of L 2 (R 2 ) and is thus a separable Hilbert space (see, e.g., 
[22], Chapter 1.4). 
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Since these spaces are metric spaces, we can use the general theory in, for 
example, Billingsley [2] or Kallenberg [21] for convergence in distribution of 
random functions in these spaces (equipped with their Borel a- fields). 

B{D) has the advantage of being a Hilbert space, which will be important 
for us later. On the other hand, 7~L{D) is, in several ways, the natural space 
for analytic functions. One important technical advantage of 7i(D) is that 
it is easy to characterize tightness. Recall that a sequence (W n ) of random 
variables in a metric space S is tight if for every e > 0, there exists a compact 
subset K C S such that P(W n € K) > 1 — e for every n. In a Polish space, 
that is, a complete separable metric space, tightness is equivalent to relative 
compactness (of the corresponding distributions) by Prohorov's theorem [2] , 
Theorems 6.1 and 6.2, [21], Theorem 16.3. [Both H(D) and B(D) are Polish, 
by the properties above.] 

Lemma 4.1. Let D be a domain in C. A sequence (W n ) of random ana- 
lytic functions on D is tight inH.(D) if and only if the sequence (sup zeK \ W n (z)\) 
is tight for every compact K C D, that is, if and only if for every compact 
K C D and every e > 0, there exists an M such that P(sup 2g ^- |W n (z)| > 
M) < e for all n. 

Proof. This is an easy consequence of the characterization of compact 
sets as closed bounded sets in TC(D). We omit the details. □ 

The embedding B(D) — > H(D) is continuous [22], Lemma 1.4.1. Thus, 
convergence in distribution in B(D) implies convergence in 7i{D). Similarly, 
if D' C D is a subdomain, then the restriction mappings TC(D) — > TC(D') 
and B(D) — > B(D') are continuous and thus convergence in distribution in 
TC(D) or B(D) implies convergence (of the restrictions) in 7i{D') or B(D'), 
respectively. 

The following theorem is a converse, which makes it possible to reduce the 
proof of convergence in 7i to local arguments. In applications, it is convenient 
to let D x be a small disc with center x. 

Theorem 4.1. Let DCC be a domain. Suppose that (W n ) is a sequence 
of random analytic functions on D and that for each x G D, there is an open 
subdomain D x with x £ D x C D and a random analytic function Z x on D x 

such that W n — —> Z x in TC(D X ) as n — > oo. There then exists a random 

analytic function Z on D such that W n — — > Z in TL{D) as n — > oo and the 

restriction Z\d x = Z x for every x. 

To prove this, we use the following general measure-theoretic lemma, 
which we copy from Bousquet-Melou and Janson [3], Lemma 7.1. 
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Lemma 4.2. Let S\ and S2 be two Polish spaces and let (f>:S\ — > S2 be 
an injective continuous map. If (W n ) is a tight sequence of random elements 

of S\ such that 4>(W n ) — —> Z in S2 for some random Z G S2, then W n — —> W 

in Si for some W with (f>(W) = Z. 

Proof of Theorem 4.1. Let, for every x G D, D' x be a small open disc 

with center x such that D' x C D x . Since W n — ► Z x in Tt(D x ), the sequence 
(W n ) is tight in 7i(D x ) for every x G D. In particular, by Lemma 4.1, the 
sequence (sup D / \W n \) is tight. 

If K C D is compact, then K can be covered by a finite number of the 
discs D' x and it follows that the sequence (sup^- \ W n \) is tight. Consequently, 
the sequence (W n ) is tight in TL{D) by Lemma 4.1. 

We now fix x G D and apply Lemma 4.2 with S\ = 7i{D), S2 = TL{D X ) 
and (j) the restriction map. Note that (j) is injective since the functions are 
analytic and D is connected. The result follows. □ 

For future use, we include the following alternative characterization of 
completeness in an arbitrary complete metric space S. If A C S, let A 6 
denote the set {x : d(x, A) < e}. 

Lemma 4.3. If{W a } is a family of random variables in a complete met- 
ric space S, then the following are equivalent characterizations of tightness 
of{W a }: 

(i) for every e > 0, there exists a compact set K C5 such that 

^(X a G' K) < e for every a; 

(ii) for every e, 8 > 0, there exists a compact set K C 5 such that 

F(X a K s ) < e for every a; 

(iii) for every e, 5 > 0, there exists a finite set F C S such that 

F(X a £F 5 )<£ for every a. 

Proof, (i): This is the standard definition of tightness [2], Chapter 6. 

(i) =^ (ii): This is obvious. 

(ii) (iii): Given e and 5, let K be as in (ii). Since K is compact, 
there exists a finite set F such that K C F s and thus K s C F 2S . Hence, 
F(X a G" F 25 ) < e. 

(iii) =^> (i) : Let F n be a finite set such that ¥>(X a (£ F}J n ) < e • 2~ n for 

1 In 

every a, and let K := Hn>i Fn ■ K is then closed and totally bounded, thus 
compact, and F(X a $ K) < En F ( x <* $ Fn") < £ for every a. □ 
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5. The Zolotarev metric on a Hilbert space. We recall the definition 
of the Zolotarev metric for probability measures in a Banach space; see 
Zolotarev [30]. 

If B and B\ are Banach spaces and / : U — > B\ is a function defined on an 
open subset U C B, then / is said to be (Frechet) differentiable at a point 
XE.UH there exists a linear operator Df(x) :B^B\ such that \\f(x + y) — 
f(x) — Df(x)y\\B 1 = o(\\y\\B) as — ► 0. Further, / is differentiable in U 
if it is differentiable for every x G U. Df is then a function U — ► L(B,B\) 
and we may talk about its derivative -D 2 / = DDf, and so on. Note that 
the mth derivative D m f (if it exists) is a function from U into the space 
of multilinear mappings B m — > B\. Let C m {B,B\) denote the space of m 
times continuously differentiable functions / : B — > B\. 

Given a Banach space B and a real number s > 0, write s = m + a with 
< a < 1 and m := \s] — 1 G N>o, and define 

^ := {/ G C m tB,M) : ||D m /(x) - D m /(y)|| < ||x - y\\ a ,x,y € S}. 

We will also write J^* := {/ € C m (B,R) : c/ G ^ for some c> 0}. 

The Zolotarev metric £s is a distance between distributions, but it is often 
convenient to talk about it as a distance between random variables, keeping 
in mind that only their distributions matter. For two random variables X 
and Y with values in B, or for their corresponding distributions C(X) and 
C(Y), the Zolotarev metric ( s is defined by 

(5.1) ( S (X,Y) := ( S (£(X),JC(Y)) := sup \E(f(X) - f(Y))\. 

Note that this distance may be infinite, but it is easily seen, by a Taylor 
expansion of /, that it is finite if E||X|| S < oo, E||Y|| S < oo, and X and Y 
have the same moments up to order m, where the kth. moment of X is KX® k , 
regarded as an element of the kth (completed) projective tensor power B® k . 

Remark 5.1. The dual space of B® k is the space of bounded multilin- 
ear mappings B k -» R. Hence, EX® k = EY® k if and only if Eg(X, ...,X) = 
Eg(Y, . . . , Y) for every bounded multilinear mapping B k — > M.. See, for ex- 
ample, [29], Chapter 45 for facts on tensor products. 

We define, for a given sequence z = (z\, . . . ,z m ) with G B® k , k = 
l,...,m, 

V s ,z(B) := {C(X) :E\\X\\ S < oo,EX® k = z k , k = l,...,m}, 

that is, the set of probability measures on B with finite absolute sth moment 
and moments Zi,...,Z)-- Thus, ( s is finite on each V S ^(B) and it is obviously 
a semi-metric there. 
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We are mainly concerned with the cases < s < 1 when m = and 1 < 
s < 2 when m = l. In these cases, we write V S {B) and V StZ (B), respectively, 
where z G H is the mean. 

For a general Banach space B, we do not know whether £ s is always a com- 
plete metric on V s ,z(B). Moreover, according to Bentkus and Rachkauskas 
[1], it is not hard to show that in a general Banach space, convergence in £ s 
does not imply weak convergence (convergence in distribution) when s > 1, 
although we do not know of any explicit counterexample. (It is easy to see 
that convergence in £ s for < s < 1 implies weak convergence, by the proof 
of Theorem 2.1(h) ==>■ (hi) in [2].) We will therefore, in the sequel, restrict 
ourselves to separable Hilbert spaces, where we can show these desirable 
properties. 

Theorem 5.1. If H is a separable Hilbert space and s > 0, then ( s is a 
complete metric on the set Vg^H) of all probability measures on H with a 
finite sth absolute moment and given kth moments z^, 1 <k < s. Moreover, 
if X n ,X are H -valued random variables with distributions in V SZ (H) and 

C s (X n ,X)^0, thenX n ^X. 

The final assertion is proved by Gine and Leon [16] . For completeness, we 
include a short proof using lemmas needed for the first part. 

Proof of Theorem 5.1. First, note that Q is a metric on V S , Z (H) 
[30]; the fact that ( s (fi, v) = implies \i = v for probability measures /i, v G 
V S}Z (H) is well known and follows easily because x i— > e^ x ' y ^ € J-* for ev- 
ery y G H, thus if ( s (fi,v) = 0, then, by (5.1), the characteristic functions 
J e i{x,y) al^x) and / e 1 ^'^' du(x) are equal, which implies that all finite-dimen- 
sional projections coincide for \x and v, and [i = v then follows by a monotone 
class argument (see, e.g., [23], Section 2.1). 

We continue by constructing some other functions in T*. Taking small 
positive multiples of them, we thus obtain functions in T s . 

Lemma 5.1. Let tp G C 00 ^) with ip(x) = x 2 for \x\ < 1/2, ip(x) = 1 for 
\x\ > 1 and 1/4 < <p(x) < 1 for 1/2 < |x| < 1. Then f(x) := tp(\\x\\) G T* for 
every s > 0. 

Proof. First, note that x i— > \\x\\ 2 is infinitely differentiable on H. (In 
fact, the third derivative vanishes.) Hence, if is any C°° function, 

then = 5((||x|| 2 ) 1//2 ) is infinitely differentiable on /f\{0}. 

Consequently, <^(||x||) is infinitely differentiable both in {a;:||a;|| < 1/2} 
and in {x: \\x\\ > 0}, and thus everywhere. Further, any derivative of order 
> 1 vanishes for ||x|| > 1 and is bounded on ||x|| < 1, hence it is globally 
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bounded. In particular, D m f is both bounded and has a bounded derivative, 
which implies that / G J-*. (Consider the cases \\x — y\\ < 1 and \\x — y\\ > 1 
separately.) □ 

Lemma 5.2. Let ijj G C°°(R) with ijj(x) > 0, ijj(x) = for \x\ < 1/2 and 
tp(x) = \x\ s for \x\ > 1. T/ien /(x) := ?/>(||x||) G JF* /or every s > 0. 

Proof. It is easily seen, as in Lemma 5.1, that / is infinitely differen- 
tiable and that D k f(x) = 0{\\x\\ s ~ k ) for every fixed k > 0. Hence, if x, y G H 
with \\x — y\\ < ||x||/2, then 

D rn+1 f(z) = 0{\\z\\ s - m ~ l ) = Odlxll"" 1 ) 

for z G [x,y] and thus 

\\D m f{x) - D m f{y)\\ = 0(\\x - y\\ \\x\r 1 ) = 0(\\x - y\\ a ). 

The same holds by symmetry if \\x — y\\ < ||y||/2. 
Finally, if \\x — y\\ > h\\x\\, h\\y\\, then 

\\D m f(x) - D m f(y)\\ < \\D m f(x)\\ + \\D m f(y)\\ 

= 0{\\x\\ s - m +\\y\\ s - m ) 

= 0(\\x-y\\ a ). 

Thus, /G^*. □ 

In the following Lemmas 5.3-5.6, we assume that {/U n }i° is a sequence 
of probability measures in Vs^H) that is a Cauchy sequence for £ s and let 
{X n }f° be H- valued random variables such that X n has the distribution \x n . 

Lemma 5.3. The random variables ||^n|| s are uniformly integrable. 

Proof. Let f{x) = tp(\\x\\) be as in Lemma 5.2 (for some fixed choice 
of ip); by Lemma 5.2, cf G T s for some constant c> 0. Let, for r > 0, f r {x) = 
cr s f(x/r). Then, as is easily seen, f r G T s for every r > 0. Further, f r [x) = 
c||x|| s when ||x|| > r, but f r {x) = when ||x|| < r/2. 

Since \ f r {X m )\ < cC\\X m \\ s for all r, where C := sup x _^ /(x)/||x|| s < oo, 
and f r (X m ) — > as r — > oo, dominated convergence yields Kf r (X m ) — > as 
r — > oo for every fixed m. 

Now, let e > and choose N such that Cs(^n 5 -X"at) < e for n > A^. Choose 
r such that Mf r (X m ) < e for m = 1, . . . , N. Then, for every n > N, 

Ef r (X n ) < Ef r (X N ) + Cs(X n ,X N ) < 2e. 

Thus, E,f r (X n ) < 2e for all n. Consequently, for all n, 

E(||X n || s l[||X n || > r]) < c^E/,.^,,) < 2c~ 1 e. 

Since e > is arbitrary, this shows the desired uniform integrability. □ 



16 



M. DRMOTA, S. JANSON AND R. NEININGER 



Lemma 5.4. The sequence (X n ) is tight. 

Proof. Let f(x) be as in Lemma 5.1, and let c > be such that cf G T s . 

Let £,r] > and assume that e,r] < 1/2. Let 6 := e 2 r]/6 < 1/2. 

Choose N such that ( s (X n ,XN) < c5 for n> N. Since H is complete 
and separable, each X n is tight [2], Theorem 1.4. Hence, there exists, by 
Lemma 4.3, for each n, a finite set F n C H such that F(X n ^ F^) < 5. Let 
F:=\J?F n . 

Let M be the subspace of H spanned by F, let P be the orthogonal 
projection onto M and let Q = I — P be the complementary projection. If 
X n G F- 5 , then ||Q(X n )|| < 5 and thus f{Q{X n )) = \\Q{X n )\\ 2 < 5 2 . Hence, 

(5.2) Ef(Q(X n )) <5 2 + F(X n $ F s ) < 25, n< N. 

Since cf G T a and Q is a linear operator with norm at most 1, it is easily 
seen that cf ' o Q G T s too. Hence, for n> N, 

\Ef(Q(X n )) - Ef(Q(X N ))\ < c-\ s {X n ,X N ) < 5 

and, by (5.2), 

Ef(Q(X n ))<Ef(Q(X N )) + 5<35, n>N. 

Combining this with (5.2), we see that Ef{Q(X n )) < 35 for all n. Hence, by 
Markov's inequality, 

(5.3) n\\Q(X n )\\ >e)< F(f(Q(X n )) > e 2 ) < 35/e 2 = r,/2. 

Next, the random variables P(X n ) lie in the finite-dimensional space M 
and sup n E||P(X n )|| s < sup n E||X n || s < oo, by Lemma 5.3. Let Kr := {x G 
M : \\x\\ < R}. Then Kr is compact and it follows from Markov's inequal- 
ity that if R is large enough, then F(P(X n ) ^ Kr) < i]/2 for every n and 
consequently, recalling (5.3), 

(5.4) F(X n i K R ) < F(P(X n ) i K R ) + P(||Q(X„)|| > e) < r). 

We have shown that for every e,rj> 0, there exists a compact set Kr such 
that (5.4) holds for all n. (Clearly, we may assume that e,i] < 1/2, as we 
have done.) By Lemma 4.3, the family {X n } is tight. □ 

Lemma 5.5. //, further, X n — — > X for some H-valued random variable 
X, then C{X) G V S , Z ( H ) and E f( X n) Ef(X) for every f G T s . 

Proof. First, by Fatou's lemma and Lemma 5.3, 

E||A1 S < liminf E||X n || s < supE||X n || s < oo. 
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Next, if / G J- s , then f(X n ) — > f{X) because / is continuous. Further, a 
Taylor expansion yields (see [30], (18)) \f(x)\ < K + for some K 

(depending on /). Hence, |/(X n )| < K + if||X n || s and {f(X n )} is uniformly 
integrable by Lemma 5.3. Consequently, E,f(X n ) -^Kf(X). 

If f(x) = g(x, . . . , x) for some continuous multilinear mapping g : B k — > IR 
with k <m, then, for every n, using the duality between tensor powers and 
multilinear mappings in Remark 5.1, 

E/(X n ) =M(g,XS k ) = (g,MX® k ) = (g,z k ). 

Further, D m f is constant and thus f £ J- s , consequently, 

(g,EX® k ) = E/(X) = limE/(X n ) = (g,z k ). 

Since g is arbitrary, this implies that KX® k = z k and thus C(X) G V s ,z(H). 
□ 

Lemma 5.6. If X n — ^ X for some H -valued random variable X , then 

Cs(x n ,x)^o. 

Proof. Let e > and choose N such that ( s (X n ,X m ) < e if n,m > N. 
For any / € T s and any n,m> N, we thus have |E/(X n ) — E/(X m )| < e. 
Letting m — > oo, we thus obtain, by Lemma 5.5, \Kf(X n ) — Ef(X)\ < e for 
n > N and every / e T s . Thus, Cs{X n , X) < e for n > N. □ 

We may now complete the proof of Theorem 5.1. First, assume that (/x n ) is 
a Cauchy sequence in V S!Z (H), equipped with metric ( s . Let X n be random 
variables with the distributions /i n . By Lemma 5.4, the sequence (X n ) is 
tight, so, by Prohorov's theorem, there exists a subsequence that converges in 
distribution to some .ff -valued random variable X. Let /i be the distribution 
of X. Considering this subsequence only, we see by Lemma 5.5 that /i G 
V s ,x{H) and by Lemma 5.6 that ^ n — > fi in V s ,x{H) along the subsequence. 
Since (iJ, n ) is a Cauchy sequence, the full sequence also converges. Hence 
V s ,z(H) is complete. 

Second, assume that X n and X are H- valued random variables with distri- 
butions in V s ,z{H) such that ( s (X n ,X) — > 0. In particular, the distributions 
C(X n ) form a Cauchy sequence in V s ,z(H), so, by Lemma 5.4, the sequence 
is tight. If a subsequence converges in distribution to some random variable 
Y, then Lemma 5.6 shows that (, s (X n ,Y) — ► along the subsequence and 

thus (s(X, Y) = 0, so Y = X. Hence, all subsequence limits of (X n ) have the 

distribution of X and since the sequence is tight, this means that X n — > X. 
□ 
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We will later use an upper bound of Cs by the minimal L s -metric £ s ; 
see Zolotarev [30] for similar bounds. The l s metric, s > 0, is defined on all 
random variables X and Y with values in B and satisfying E||X|| S , E||Y|| S < 
oo by 

e s (X,Y):=£ s (C(X),£(Y)) 

:= inf{(E||X' - y'||*)(V«)Ai . C (x') = C{X),C{Y') = C{Y)}. 

Lemma 5.7. For all C(X),C(Y) £ V S , Z (B) and s>l, we have 
(5.5) C(X, Y) < {{EWXW 8 ) 1 - 1 / 3 + (E||y W'f-^isiX, Y). 

For < s < 1, we have 

( S (X,Y)<£ S (X,Y). 
Proof. For s > 1 and arbitrary f £ J- s , we define 

g(x) := f(x) - /(0) - Df(0)(x) —D m f(0){x, ...,x). 

ml 

Thus, we have 

5(0) = Dg(0) = ■■■ = D m g(0) = 0, \\D m g{x) - D m g{y)\\ < \\x - y\\ a . 
This implies, by backward induction on k, that 

\\D k g(x)\\ < \\x\\ s - k , 0<k<m. 
Thus, with Z := Y — X, we obtain, for an appropriate < i? < 1, 

\g(Y) - g(X)\ = \g(X + Z) - g(X)\ = \Dg(X + 0Z)(Z)\ 

< \\Dg(X + 0Z)\\\\Z\\ < \\X + $Z\\ S - 1 \\Z\\ 

< (lixii^ + nyf-^iizii. 

This implies, using Holder's inequality, that 

|E[/(y) - f(X)]\ = \E[g(Y) - g(X)]\ < E[(\\X\r l + 1 1 ^ 1 1 — 1 ) 1 1 ^ 1 1 ] 

< ((e\\x\\ s ) i - i/s + (E||yin i ~ i/s )(E||zin i/s . 

Taking the supremum over all / £ J- s and the infimum over all realizations 
of C(X) and C(Y), we obtain (5.5). 

For < s < 1, we have \E[f(Y) - f(X)}\ < E\f(Y) - f(X)\ < \\X - Y\\ s . 
This implies that ( s {X, Y) < l s (X, Y) . □ 
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6. Contraction method for Hilbert spaces. In this section, we extend the 
contraction method as developed for the Zolotarev metric on R rf in [27] to 
random variables in a separable Hilbert space H. We denote by V(H) the 
set of all probability distributions on H. The limit distributions occurring 
subsequently are characterized as fixed points of the maps 

T:V{H) ^V(H), 

(fU) 

c(z)^clpA;(z^) + b*\ 

where m > 1 is an integer, A*,. . . , A* m are random linear operators in H, b* is 
a random variable in H and (A*, . . . , A^, &*), Z^ l \ . . . , are independent, 
with £(ZW)=£(Z) for r = l,...,m. 

We write ||-A|| p :=su P||x|=i for a linear operator A in H. We say 

that A* is s-integrable if E 

V S {H) and T^o :=V Sj o(H), as defined in Section 5. 



j4*Ho P < oo. Furthermore, we abbreviate V s 



Lemma 6.1. Let (A\, . . . ,A* m) b*) be as in (6.1) and s-integrable for some 
< s < 2. For < s < 1, we have T(V S ) <^V S - For 1 < s < 2 and E6* = 0, 
we have T(V s fi) C 7\o- 

Proof. The existence of moments of order s of T(C{Z)) with E||Z|| S 
follows by the independence of A* and Z^ r \ For 1 < s < 2 and E6* = 0, we 
obtain that T(C{Z)) is centered. □ 

Lemma 6.2. Let (A*, . . . , A^, b*) be as in (6.1) and s-integrable for some 
< s < 2. Assume that 

m 

(6.2) E£PX P <1- 

r=l 

//0 < s < 1, i/ien the restriction ofT to V s is a strict contraction. Ifl < s < 2 
and E6* = 0, then the restriction ofT to V s ,o is a strict contraction. 

Proof. This is similar to the proof of Lemma 3.1 in [27]. Note that for 
a linear operator A in H and C(X),£(Y) G V s>z , we have ( S (A(X), A(Y)) < 
\\A\\ s op ( s (X,Y) (cf. Zolotarev [31], Theorem 3). □ 

Lemma 6.2 and Theorem 5.1 imply that the restrictions of T in Lemma 
6.2 have unique fixed points in V s and Vs^, respectively. 

We consider a sequence (X n ) n >Q of random variables in H satisfying the 
recurrence 

m 

(6.3) ^«=E4 B) (4)+ 6(n) . n ^ n °> 

r=l 
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where uq > 1, Ar are random linear operators in H, M n ) is a random 
var iable in H, /(») = (/{ n) , . . . ,/£°) is a vector of random integers with 
1^ G {0, . . . , n} , we have that (X^ ),..., ) , (A^ , . . . , , &(») , /(») ) 

are independent and C{X^p) = C(Xj) for all r and j. We then have the 
following extension of Theorem 4.1 in [27]. 

Theorem 6.1. Let (X n ) be as in (6.3) with all quantities there being 
s-integrable for some < s < 2. For 1 < s < 2, assume that EX n = for all 
n>0. Assume that, for appropriately chosen (A\, . . . , A* m , b*), we have 

(6.4) KWA^ -A* r \\ A op ^0, E||&( n ) -&*f->0, 

m 

(6.5) e£kiiSp<i, 

r=l 

(6-6) E[l {J (»)^ } ll4 n) ||Sp]-^0, 

/or all £gN and r = 1, . . . , m. M^e i/ien /iaue 

(6.7) C(^ n ,X)^0, twoo, 

where C{X) is the unique fixed point of T in V s for < s < 1 and in V s ,o 
for 1 < s < 2. 



Proof. The proof of Theorem 4.1 in [27] can be directly extended. For 
1< s < 2, from EX n = and (6.3), we obtain Eb^ = 0, thus (6.4) implies 
E6* = 0. Therefore, Lemma 6.2 implies existence and uniqueness of the fixed 
point of T, as claimed in the theorem for all < s < 2. 

We introduce the accompanying sequence 

m 

(6.8) g n 4^4 n) (^ {r) )+^ (n) , n>n , 

r=l 

where (A^ , . . . , A%> , , ) , I« , . . . , XW , ) , . • . , {xt ] ) are inde- 
pendent, with £{X^) = C{X) for r = 1, . . . ,m. We obtain that E||Q n || s < oo 
and, for 1 < s < 2, that EQ n = 0. Thus, the Cs distances between X n , Q n and 
X are finite for n > no- We obtain, from the triangle inequality, that 

(6.9) Cs(X n ,X) < Cs(X n ,Q n )+Cs(Qn,X). 

First, we show that Cs{Qn,X) — ► 0. For this, note that we have E||X|| S < oo 
and sup n>n E||Q n || s < oo by representation (6.8), independence and (6.4). 
Hence, Lemma 5.7 implies that Cs(Qn, X) < C£ s (Q n ,X), with some constant 
C that is independent of n. For a random variable Y in H, we denote the 
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L s -norm of Y by \\Y\\ S := (E||y || s ) 1 / s . Using (6.4), we obtain from X = 
J2A* r X^ + b* that 

sAl 



,(Q n ,x) < 



E(^-4 n) )(^ {r) : 



r=l 



(n) _ ^* 



sAl 



< j2\\\\A;-A^^\ jx\\\\i A1 + \\bW-bX A1 

r=l 

— ► 0, n — ► oo. 



Next, we bound the first summand in (6.9). Let T n denote the joint distribu- 
tion of {A^\. . .,A^l\b^ n \l^) and write a = (a 1: . ..,a m ), j = (ji, . . . , j m ). 
We then obtain, for n>no, 

Cs(X n , Q n ) 

(in m \ 

E 4 n) ) + ^ (n) , E 4 n) (* (r) ) + & {n) 

r=l r r=l / 

// m m \ 

C s E^( X i r) )'E^(^ W ) dT n(«,/3,j) 

\r=l r=l / 

< jf^\\a r \\ s op UXjr,X)dT n ( a ,(3,j) 



(6.10) 



r=l 



< eeii4 



n) us 



op 



r=l 



max ( s (Xj,X). 

0<j<n 



Thus, using (6.9), it follows that 



C s (x n ,x)< eEII^I 



op 



r=l 



max C s (X J ,X)+o(l). 

0<j<n 



We let d n := Q s (X n ,X). We show that the sequence (d n ) is bounded. With 

r n ■= ( s (Qn,X), :=EJ2^=i ||4 n) ||o P , d* n : = max <j<„(ij and i? := sup n > nQ r n , 
we obtain from (6.9) and (6.10) that 



(6.11) 



d n <i n d n + r n , n>n . 



By (6.4) and (6.5), there exist £ < 1 and m > no such that £ n < £ for all 
n>n\. Let B := + i?/(l - £). We claim that d* n < B for all n > 0. If this 
fails to hold, then for some n, we have < B < d n = d n V d n _ 1 . Hence, 
d n = d* n > B. Moreover, we have n > n\. Hence, (6.11) yields d n < £d n + R, 
thus d n < i?/(l — £) < B, contradicting our assumption. Consequently, we 
have d* < B for all n > 0. 



22 M. DRMOTA, S. JANSON AND R. NEININGER 

Let r\ := limsup n _ +00 d n and let e > be arbitrary. There exists an I 6 N 
with d n < f] + e for all n >£. We deduce, using (6.10), (6.9), (6.6) and (6.4), 
that 

/m 
]C 1 {v<jr<t}\\ a r\\ S o P Cs{X jr , X) dT n (a, ft, j) 

r=l 
„ m 

+ / 1 {^<><n}ll«r||opCs(^>,^)^n(a,^,j) + ^„ 
r=l 

m m 

< E E( V>< n ll4 n) llop) + + ll4 n) llop + r n 

r=l r=l 

<^e£ k||SpJ + + o(l). 
With n — ► oo , we obtain 

Since e > is arbitrary and E^^Li ||^4*llop < 1) we obtain 77 = 0. Hence, 

C s (x n ,x)^o. □ 

Remark 6.1. Note that the conditions (6.4) on the L s -convergence of 
the coefficients could be replaced by the joint ^-convergence 

(4t.,4»))-^(A; 41*). 

Remark 6.2. We assume s < 2 in the theorem above, unlike the finite- 
dimensional theorem in [27], where s < 3. The reason is that for 2 < s < 3, 
we need to normalize the second moments, which is not generally possible 
in an infinite-dimensional Hilbert space. 

7. Contraction method for analytic functions. In this section, we link the 
general contraction theorem of Section 6 to recurrences of random analytic 
functions which will be needed subsequently. Suppose that (X n ) n >o is a 
sequence of random analytic functions in a domain DCC such that, for 
some no > 1, 

m 

(7.1) X n ± Y14 n) - +b (n) , n>n , 

r=l 

where , . . . , A$ and are random analytic functions in D and 1^ = 
(II , . . . , Im ) is a vector of random integers with 4"' 6 {0, . . . , n} . Further- 
more, C(Xj T ^) = C(Xj) for all r and j, and we have that (A^ 1 , . . . , Am \ 
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/(»)), (xf ) ),...,{xf l) ) are independent. Note that the dot in (7.1) is 
meant as a pointwise complex multiplication. Hence, (7.1) is a special case 
of recurrence (6.3) with the random variables being analytic functions. The 
maps corresponding to T in (6.1) now have, for some domain D, the form 

T:V(H(D))^V(H(D)), 

(7 ' 2) 

C{Z)^C\Y^A*-Z^ + b*\ 

where A\, ...,A^ and b* are random analytic functions in D, C(Z^) = 
C{Z) for r = l,...,m, and (A\, . . . , A* m ,b*), Z^\...,Z^ are independent. 

We say that a random function is locally bounded in L s if the function 
zi-^E|H(z)| s is locally bounded. 

Theorem 6.1 then implies the following theorem. 

Theorem 7.1. Let < s < 2. Let (X n ) be as in (7.1) with X n , A^ , 
&( n ) being analytic functions of z € D locally bounded in L s . For 1 < s < 2, 
assume that KX n (z) = for all n>0 and z £ D. Assume that A\, . . . , A* m 
and b* are random analytic functions in D and that ACO is a connected 
subset such that for each x £ A, there exists a neighborhood U x C D of x 
and a number s(x) < s such that 



(7.3) 



su V E\Ai n \z)-A* r (z)\ s{x) ^0, 

z£U x 

supE|6W(z)-6*(z)| s{a;) ^0, 

z&J x 



(7.4) su V ¥,\A* r {z)\ s{ - x) <oo, 

zeu x 

m 

(7.5) ^Y,\ A *( X )\ S{X) <!» 

r=l 

(7.6) supE[l (n) |4 n) W| s(x) ] 



0, 



/or alll£N and r = 1, . . . , m. There then exists a domain D Q D with ACD 
and a random analytic function X in D such that, as n— > oo, 

(7.7) X n ^X inH(D), 

where C{X) is a fixed point of the map T from (7.2). Moreover, for each 
x € A there exists a neighborhood U x such that Esup 2g jj |X(z)| s ^) < oo 

and, if s(x) > 1, also EX(z) = 0, z £ll x . C{X) is the only fixed point of T 
with this property [even for a single x £ A and even with the integrability 
condition weakened to sup 2g ^ E|A"(z)| s ^) < oo]. 
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Proof. We begin by showing that the assumptions hold locally uni- 
formly in the following sense. Each x G A has a neighborhood U' x = B(x, g x ) Q 
D with g x > such that 



Esup \A^ l \z)-A* T {z)\ s{x) ^Q, 



(7.8) 



Esup \b in) (z)-b*(z)\ s(x) ^0, 



m 



(7.9) sup \A*{z)\ s ^ <1, 



1 w sup 



(7.10) E 
To show this, we use the following simple lemma. 



Lemma 7.1. If ^ is a random analytic function in a disc B(w,r), then, 
for some constant C > and every s > 0, 

E sup \^(z)\ s <C sup E|^(z)| s . 

z£B(w,r/2) z€B(w,r) 

Proof. Let 7 be the circle \z — w\ = 3r/4, and let I7I = 3irr/2 be its cir- 
cumference. Since (^(z)) 5 is subharmonic, standard properties of the Poisson 
kernel yield 

sup \^(z)\ s <C\~i\~ l f \^{z)\ s dz. 

z£B(w,r/2) J~1 

Taking expectations on both sides implies the assertion. □ 

Now, by assumption, for every x € A, there exists a disc B(x, g x ) such that 
(7.3) holds uniformly in B(x, g x ). Lemma 7.1 now shows that (7.8) holds with 
U' x = B{x, q x /2). Similarly, (7.6) and Lemma 7.1 applied to 1, (») x Ar 

yield (7.10). For (7.9), we first note that (7.4) and Lemma 7.1 imply that, 
for each r = 1, . . . , m, 

E sup \A* r (z)\ s(x) <oo 

z£B(x,q x ) 

for a suitable g x > 0. Hence, dominated convergence shows that, as g J, 0, 
since A* is continuous, 

E sup \A;{z)\ s W ^E\A* r {x)\< x \ 

\z—x\<Q 
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Summing over r = 1, . . . , m and using (7.5), we see that (7.9) holds for some 

Consider now a disc U' x such that (7.8)-(7.10) hold. We may assume 
that U' x C D. We let H be the Bergman space B{U' X ) and regard (the re- 
striction of) X n and b^ as random elements of H, while the Ar are re- 
garded as random pointwise multiplication operators / 1— > Ar ■ /• Clearly, 

1 1 Ar 1 1 op ^ 

sup 2€f/ / Note that Lemma 7.1 implies that E||X n || s < 

oo, E||6( n )|| s <oo, and E||^ n) ||^ p < oo. 

By (7.8)-(7.10), Theorem 6.1 applies in H [with s replaced by s{x)\ and 
shows that there exists a random analytic function Z x G H = B(U' X ) such 

that X n Z x in B(U X ), and thus in 7i(U x ), with C(Z X ) being the unique 
fixed point of T from (7.2), with D = U' x , in V s{x) {B{U' x )) for < s(x) < 1 and 
in V s ( x )fi{B(U x )) for 1 < s(x) < 2. The result now follows from Theorem 4.1 

with D = U^gA U' x . Note that the condition E||X||g^,j = E||Z x ||g^/\ < oo, 
by Holder's inequality, implies that 

E \X(z)\< x Um(z)=E\\X\\ s £l Hu , ) < C(x)M\\X\\ ^ < co 

and the fact that \X (z)\ s ^ is subharmonic implies that for any strictly 
smaller disc U' x ' C U' x , we have sup 2eC7 „ \X(z)\ s ^ < C'(x) fo, \X(z)\ s ^ dm(z). 
Hence, 

(7.11) E sup \X(z)\ s ^ < oo 
and thus 

(7.12) sup E|X(z)| s ^ <oo. 

zeu'j 

Conversely, (7.11) implies that E||X||g^„j < oo and (7.12) implies (7.11) in 
a smaller disc, by Lemma 7.1, so any of these, together with MX(z) = in 
a neighborhood of x when s(x) > 1, yields uniqueness of C(X\q ) for some 

neighborhood U x , and thus uniqueness of C(X). □ 

Remark 7.1. Theorem 7.1 and its proof immediately extend to (finite- 
dimensional) vector-valued functions X n and and matrix-valued 

functions A r n ' . Condition (7.5) then becomes E^ r 1 1 -<4* 0*0 1 1 op < 1- 

8. The expected profile. The purpose of this section is to discuss in 
detail the first (and second) moment of the profile polynomials W n {z) = 
Y^k>o-^-n,k zk and the expected value KX n ^. We already know that for fixed 
z E D\ (recall the definition of the set D s in Section 3), the expected profile 
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polynomial EW n (z) behaves as E(z)n Xl ^ 1 . In order to get more precise 
results, we need more information on Xi(z). 

Lemma 8.1. Suppose that s > —t. D s is then an open domain in the 
complex plane that contains the interval (F(s), oo) and X\(z) is analytic in 
D s . 

Furthermore, if z G D s , then all z'gC with \z'\ = \z\ and |arg(z')| < 
| arg(z)| are also contained in D s . Moreover, the mapping ip t— > $t(\i(\z\e lip )) 
is strictly decreasing for < ip < | arg(z)|. In particular, 9?(Ai(z)) < Ai(|^|) ; 
where equality holds if and only if z is real and positive. 

Proof. Recall that \\(z) is the root of F(X) = z with largest real part. 
Further, note that F is a polynomial of degree d = (m — l)(t + 1). For the 
sake of brevity, we will only discuss the case d > 2. The three cases m = 2, 
t = (where d = 1), m = 3, i = (where d = 2) and m = 2, i = 1 (where 
d = 2) can be treated separately (and are, in fact, very easy). 

We will first describe the "inverse map." For this purpose, we consider 
the mapping 

r i ► F(cr + ir) (r G M) 

for fixed a > 0. Since F(a — ir) = F(a + ir), it is sufficient to consider r > 0. 
By definition (3.5), it is clear that the argument arg(i ? (<T + ir)) and the 
modulus \F(a + ir)| are strictly increasing functions for r > and that 
we have lim r _ KX) arg(i ? (cr + ir)) = dir/2. Hence, if d > 2, then there ex- 
ists a unique minimal tq = tq(u) > with arg(F(<r + iro)) = it. Note that 
the mapping a *— > \F{a + ir)| is also strictly increasing, but the mapping 
a i — ► arg(i ? (cj + ir)) is strictly decreasing, for a > and fixed r > 0. This 
implies that the curves 7^7 := {F(a + ir) : < r < ro(cr)}, a > 0, are all dis- 
joint and that the mapping a ^ ro(cr) is strictly increasing. Further, the 
mapping a i— > ro(cx) is continuous. It also follows that we can parametrize 
7+ as {r a {cf))e 1 ^ : < (f> < 7r} for some strictly increasing continuous function 
r a on [0,7r]. 

The curve 7 ff := {F(a + ir) : -t (<t) < r < r (a)} = {r CT (|^|)e^ : -tt < ^ < 
7r} is a simple closed curve that is the boundary of a compact set K a = 
{re^-.r G [0, r CT (|(/>|)), <fi G [— vr,7r]}, that is, the union of 7 CT and its interior. 
Our next goal is to show that D s = C \ (if s U L s ), where L s := (— oo, F(s + 
iro(s))] is a half-line. For this purpose, consider the set 

Z s := {A G C : K(A) > s, -r (K(A)) < 9(A) < r (K(A))}. 

Suppose that A£Z S . Then F(A) G 7j{(a) and thus -F(A) ^ 7 S . Moreover, F(X) 
can be connected to oo by a path disjoint from ^y s (e.g., a piece of 7sr(a) plus 
the half-line [.F(3i(A)),oo)) and thus F(X) belongs to the exterior of 7 S , that 
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is, F(X) ^K s . Since A G Z s further implies that |arg(F(A))| < it, we also 
have F(X) <£ L s . Consequently, F : Z s -> C \ (K s U L s ). 

Since the curves r y (T , a > s, are disjoint and simple, F is injective on Z s . 
Furthermore, to see that F maps Z s onto C\ (K s UL S ), suppose the contrary. 
Since C \ (K s U L s ) is connected, there would be some z G C \ (K s U L s ) 
such that z G F(Z S ) \ F(Z S ). Thus, there would exist a sequence A n G Z s 
such that F(X n ) — > z. This implies that the sequence (A n ) is bounded and 
there thus exists a subsequence converging to some A G Z s . By continuity, 
F(A) = z, and since z ^ F(Z S ), this implies that X^ Z s and thus that A G 
dZ s . But, if A G <9Z S , then either F(X) £-f s cK s or 9(A) = ±r (K(A)) and 
F(A) G L s . Both cases contradict F(A) = z G C \ (A s U Consequently, 
F : Z s — > C \ (K s UL S ) is a bijection. 

Now, let z G C\ (A s U L s ). We will show that z £ D s . We have just shown 
that then there exists 9 £ Z s with F(9) = z. By symmetry, we can assume 
that ^s(9) > 0. By the monotonicity properties of \F(a + i-r) | , it follows that 
\F(a + i-r) | > |;z| if a > $1(0) and |r| > 9(0). Further, if a > K(6>) and |r| < 
9(0), then (T + ir G Z s , so F(a + it) ^ F(9) = z unless a + ir = 9. Hence, 
F(X) = z has no other root with 31(A) > 9?(0). Moreover, F'(9) / (e.g., 
because F is a bijection on Z s ) and thus 9 is a simple root of F(X) = z. 
Consequently, 9 = X\(z) and 5R(Ai(z)) > $i(X2(z)), which implies that z G 
D s . 

Similarly, if 9 = s + ir' with \r'\ < r (s), then \F(a + ir)| > F(6») if a > s 
and |r| > |r'|, while if a > s and |r| < |r'|, then | arg(F((T + ir))| < | arg(i ? (0))|. 
Hence, if a > s and r G M, then F(a + ir) ^ 7 S . Since the half-plane {a + \t : 
a > s} is connected, it is thus mapped by F into the exterior of %, that is, 
into C \ K s . Consequently, if z G K s , then -F(A) = z has no root with 5RA > s 
and thus z ^ D s . Finally, if z G then z = F(a ± iro(o")) for some a > s. 
Thus, the fact that z G 7 CT and the argument just given together imply that 
F(#) = £ has no root with K(#) > a. Hence, 3?(Ai(z)) = <r, but there are two 
such roots, a ± iro(o"), so 3?(Ai(z)) = K(A2(-z)) and z ^ D s . 

We have shown that 

(8.1) D s = C\ (A' s U L s ) = {re^ : r > r s (|^|), -vr < <f> < vr} 

and that the inverse mapping F~ l : C \ (K s U L s ) — > Z s explicitly computes 
Xi(z) = i ?_1 (z), which is a simple root. Note, too, that X±(z) (for z G D s ) is 
characterized by the property that it has smallest absolute imaginary part 
among all solutions of F(X) = z with K(A) > s. By the implicit function 
theorem, X\(z) is analytic in D s . 

Since F(s) is the only boundary point of D s on the positive real line, it 
follows that D s contains the interval (F(s), oo). [Alternatively, use (8.1).] 

Finally, (8.1) and the fact that r s is strictly increasing on [0, it] together 
imply that if re 1 ^ G D s and \(j)'\ < \<f>\ < ir, then r > r s (\<f)\) > r s (\cj)'\) and 
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re^' G D s . Moreover, if a := sft(Ai(re^)), then re^ G dD a = j a U L CT , and 
thus re 1 ^ G 70-. The same argument shows that r = r a (\(f)\) > r a (\(f)'\) and 
rf& G At- Hence, K(Ai(re^')) > a = K(Ai(re^)) and the final statement in 
the lemma follows. □ 

The next step is an extension of Lemma 3.1. Note that 5i(Ai(z)) is well 
defined for all z G C. (Estimates involving logn are only supposed to hold 
for n > 2.) 

Lemma 8.2. Let W n (z) = J2k>oX n ^z k denote the (random) profile poly- 
nomials. 

(i) If K is a compact subset of D±, then there exists 5 > such that 

(8.2) EW n {z) = n Xliz) -\E{z) + 0{n~ 6 )) 

uniformly for z G K. 

(ii) if K is a compact subset ofC, then there exists D>0 such that 

(8.3) \KW n (z)\ <n max { R ( A i(*))- 1 '°}(logn) IJ 
uniformly for z G K. 

Proof. The proof is a direct extension of the results of [9] applied to 
the recurrence relation (3.3). In particular, we have to take care of the uni- 
formity in z G K. This can be done by a careful inspection of the proof in 
[9]; see the Appendix. □ 

With the help of Lemma 8.2, we directly obtain bivariate asymptotic 
expansions for EX,,^ in a large range. It turns out that one must solve the 
equation 

(8.4) 0\' 1 (P) = a. 
From (3.5), it follows that 

(3\[(P) = F(\ 1 ((3))/F'(\ 1 ((3)) 

(8.5) 

_ / 1 1 1 

~ Ui(/?)+t + Ai(/?)+i + l + " ' + Ai(/?) + (i + l)m-2 

Note that this formula also shows that the mapping (5 \— > [3\'i {(3) is strictly 
increasing because \\{(3) is strictly increasing for (3 > 0. Moreover, Ai(/3) 
increases from —t to 00 for < (3 < 00 and it follows that (3\' l {(3) increases 
from to 00. Hence, (8.4) has a unique solution (3{a) > for every a > 0, 
with /3(a) strictly increasing. Since we must assume that Ai > 1, we note 
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that (8.4) has a proper solution with Ai(/3) > 1, and thus [3 G D\, if and only 
if 

/ 1 1 1 \ - 1 

a > a : = - — 7 + T — ~ H h 



i + 1 t + 2 (t + l)m-l 

Lemma 8.3. Suppose that ol\^oli with ao < ct\ < 012 < 00 are given and 
let (3(a) be defined by (3(a)\'i((3(a)) =a. Then 

E(8(a 1 ))n Xl W a n,k))-an,,k log(/3(a nifc ))-l 

EA nifc = _ (l + 0((logn) 7 )) 

y 2vr(a nifc + /5(a riifc ) 2 A / 1 / (/3(a nifc ))) logn 

uniformly for a n> k = k/ log n G [ai , 02] as n,fe->oo. 

Proof. By Cauchy's formula, we have 

EI„, fc = ^- f EW n (z)z~ k ~ l dz. 
2m J\z\=i3 

Note that EW n (z)z~ k behaves as 

(8.6) EW n (z)z~ k ~ E(z)n Xl ^- 1 z- k = I^( z ) e A 1 «logn-felo gz _ 

n 

In order to evaluate the above Cauchy integral, we use a standard saddle 
point method. The saddle point of the function z 1— > Ai(z)logn — /clog z is 
given by z = /?, which satisfies p\[((3) = fc/logra, that is, by /3(fc/logn). 

By construction, the real interval \fl(ai), f3(ot2)] is contained in Z?i. Hence, 
there exists 7 > such that the set {z G C : \z\ G [(3 (ai), f3 (02)], | arg(z)| < 7} 
is also contained in D\. 

Let K = {z G C: \z\ G [/3(q;i),/3(q!2)],7 < | arg(z)| < tt}. By Lemma 8.1, 
there then exists 77 > such that for all (3 G \(3(a\) , (3(a2)\, 

max max{sft(Ai(2r)),l}< Ai(/3) — 77. 

zeK,\z\=/3 

[Uniformity follows from the continuity of Ai(z).] Hence, by Lemma 8.2, 

\EW n {z)z- k ~ 1 \ dz < n MK^))-i-v/2-a\og(i3(a)) ^ 



L 



|*|=A|arg(*)|>7 

where a = k/logn and (3 = (3(a). Thus, this part of the integral is negligible. 

For the remaining integral (leading to the asymptotic leading term), we 
use the substitution z = j3e lt (\t\ < 7) and the approximation 

(8.7) Ai (z) log n — k log z = (\\([3) — a log (3) log n 

(8.8) + i((Ai'C9) + a/3" 2 ) logn)(z - (3) 2 

(8.9) +0(logn\z- p\ 3 ) 
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to obtain the final form after standard saddle point algebra. □ 

In what follows, we will also need estimates for the second moments of 
W n (z). 

Lemma 8.4. For every compact set KCC, we have 

(8.10) E|^ n (z)| 2 = O( n max{A 1 (|,P)-l,2K(A 1 (,))-2,0} (logn) D' ) 

uniformly for z G K , where D' > is an absolute constant. 
Proof. We use (3.2) twice, for z and z, and obtain 
\WUz)\ 2 ^\z\\\W^(z)\ 2 + ... + \W^Jz)\ 2 ) 

(8.H) +M 2 E 

+ 2(m -m(jt zW$ d (z)J + (m - l) 2 . 

We take the expectation. By Lemma 8.2, [EW^(z)| = 0(A n (z)), uniformly 
for z G K, where A n (z) := n max W Al W)- 1 ' >(logn) IJ (for some fixed D>0) 
for n>2, and Aq(z) := ^2(2) '■= 1, say. Hence, for / < n, and uniformly in 
zGiC, 

E(WV„,x (*) I K,i = = O(Mz)) = 0(A n (z)) 

and thus 

E(zW Vn Jz) I F n ,i = Z) = 0|E(WV nil («) I V n ,i = l)\=0(A n {z)) = 0{A n {zf). 
Similarly, for l\ + I2 < n, and uniformly m z € K, 

E(W Vna (z)W Vn Jz) I y n>1 = h,V n , 2 = h) = 0(A h (z)A l2 (z)) = 0(A n (zf). 
Consequently, uniformly for x G K, (8.11) yields, using (2.2), 

(8.12) E\W n (z)\ 2 = m\z\ 2 E\W Vn ^z)\ 2 + 0{A n {z) 2 ) 

n-l n-e-1 \ 

(8.13) = m\ z \ 2 x; VfA r (m ~y +T ?~ 2; E|w £ ^)i 2 

£=0 Vmi+m— 1/ 

(8.14) +0(n max ^ Al W)" 2 '°>(logn) 2D ). 

This is an equation of the same type as (3.3) and we can again apply [9] 
to obtain the stated estimate. As in the proof of Lemma 8.2, an inspection 
of [9] shows that the estimate holds uniformly in z; see the Appendix for 
details. □ 
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Remark 8.1. A special case of this result for m = 2 and t = has been 
proven in [4]. In this case, we have \\(z) = 2z and obtain (for some D > 0; 
in fact, D = 2 will suffice for all z and D = 1 or will suffice for all z ^ 1/2) 

^\W n {z)\ 2 < n max{4^-2,0} (logn )D Q z _ y < 

and 

^\W n {z)\ 2 < n^M^-^ilogn) (\z - 1| > l/y/2). 

Remark 8.2. The method of Lemma 8.4 can be used for many other 
junctionals of W n (z). For example, the expected derivative KW^(z) satisfies 
the recurrence 

n-l (A ( n-i-l \ 

EW'(z) =mzY: f E^(z) 



£=0 \mt+m- 



-l) 



n-l (*)( n-i-l \ 

w Mm— l)t+m— 2/ / 

+ m f n N ®W e (z). 

£=0 \mt+m— 11 

For simplicity, let z G Z?i be real and nonnegative. Then, from 
n-l ( e ) ( n-i-l \ 
m £ W ; (m -y + "" 2j E^(z) = -(EW n (z) - (m - 1)) = O^W" 1 ) 

and an application of [9], we obtain 

E<(2) = 0(?i Al(2) - 1 logn). 
[This also follows from (8.2) by Cauchy's estimates.] 

We close this section with a proof that the sets / and I' (defined in 
Theorem 1.1) are in fact intervals. 

Lemma 8.5. Let /:={/?> : 1< Ai(/? 2 ) < 2A a (/3) - 1} and /' := {pX^ (/?) : 
/?€/}. TTien I and V are open intervals that are contained in the positive 
real line. More precisely, 1 € JC (^,/3(a+)) and a max G I' C. (ao,a+). 

Proof. Since Ai(z) is increasing for z > 0, it is clear that ii := {/3 > 
: 1< Ai(/? 2 )} is an interval. We show that J 2 := {/3 > : Ai(/? 2 ) < 2Ai(/3) - 
1} is also an interval, which implies that I = I± H I2 is an interval. 

Suppose that > and that A = Ai(/3) with Ai(/? 2 ) = 2Ai(/?) - 1. Then 
F(2A-1) = F(Ai(/3 2 )) =I3 2 = F{\) 2 . However, for A > — (t - l)/2, we have 

1 mt+m—2 / 9 9 \ 

-(l„ g F(2A-l)-2 1 „gF(A))= £ ( S _ T ^ ), 
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which is greater than for A < 1 and less than for A > 1. Thus, q(X) : = 
F(2A — 1)/F(\) 2 is strictly increasing on [(1 — 1)/2, 1] and strictly decreasing 
on [l,oo). Moreover, g((l - t)/2) = 0, g(l) = 1/F(1) = m > 1 and g(A) -> 
as A —> oo. Consequently, there are exactly two roots, AJ < A2, of F(2X — 1) = 
F(A) 2 in [(l-t)/2,oo) and two roots (3* = F(X*) > of Ai(/? 2 ) = 2Ai(/3) - 1. 
[Note that 2X 1 (/3) - 1 = Ai(/? 2 ) > -t implies that Ai(/3) > (1 - t)/2] Since 
Ai(l) = 2, it is easily seen that Ai(/? 2 ) < 2Ai(/3) - 1 (for /? > 0) if and only 
if &*</?< (3* 2 _ 

Set I = (J3, /?). Since A(l) = 2, we surely have lei. Next, note that (5 = ^- 
corresponds to Ai(/3) = 1. Thus, Ai(l/m 2 ) < Ai(l/m) = 1, which implies that 
^<£- 

In order to prove /?+ := (3(a+) > 0, it suffices to show that Ai(/3 2 _) > 
2Ai(/?+) - 1 or, equivalents, that F(2Ai(/3+) - 1) < F(X 1 ((3+)) 2 [since 
F(Ai(/?+)) 2 = Pi = F(Ai(/3 2 ))]. First, by definition, 

mf+m— 2 1 

logF(A 1 (/3+)) = (A 1 (/3 + )-l) TT^TX^- 

Moreover, with S + := ES +m_2 (^i(/ 3 +) + «)~\ it follows by a convexity ar- 
gument (compare with [5], Lemma 3.2) that, for every A > 1, 

logF(A)<(A-l)5 + , 

with equality only for A = A 1 (/?+). Consequently, 

logF(2AiG9 + ) - 1) < (2Ai( / 9+) - 2)5+ = 21ogF(Ai (/?+)). 

Thus, we have F(2Ai(/3 + ) - 1) < F(Ai(/3 + )) 2 and consequently (3(a + ) >J3. 

Finally, since the mapping /3 1— > /3X'i((3) is strictly increasing, by (8.5), 
and continuous, it also follows that /' is an interval [that is contained in 

(a , <*+)]■ D 

Remark 8.3. In Theorem 1 1 . 1 , we consider the set J = {/3 > : Ai (/5 2 ) < 
2Ai(/3) — 1} instead of I. This equals I2 in the proof above, so J is also an 
open interval. Furthermore, a slight extension of the above proof shows that 
J C (/3(a_),/3(aq_)). The proof shows that f3 + > sup J, and /3_ := 0{aJ) < 
inf J can be shown in exactly the same way. 



9. Proof of Theorem 1.2. In this section, we prove Theorem 1.2. 



Proof of Theorem 1.2. The sequence of random analytic functions 
(W n ) in Theorem 1.2 satisfies the recurrence (3.2). Hence, for 

-= w n (z)-mv n {z) = w n (z) _ 

n(Z) ' EW n {z) EW n {z) ' 
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we obtain, with G n (z) :=KW n (z), that 



in 



Hence, we have 



r=l 



with 4 n) = K,r, Ai n) = zG Vn Az)/Gn(z), 

b(n) = Gjz) ( m - 1 - G - W + Z E ^^n,r 

and conditions as in (7.1). We will see below that the sequence (X n ) of 
random analytic functions satisfies the conditions of Theorem 7.1 for all 
1< s < 2 with D = {z £ Z?i : E(z) / 0}, 

= z^iW" 1 , 6* = z^V^W- 1 - 1, 



r=l 

for r = 1, . . . , m, and A = (1/m, /?(«+)). 

Theorem 7.1 then implies that X n X in TC(D), where D is a complex 
neighborhood of the real interval (1/m, /3(aq_)) and C(X) is the fixed point of 
T defined in (7.2), with the integrability condition from Theorem 7.1. Recall 
that for x £ (1/m, f3(a+)), we have X±(x) > 1 and note that this convergence 
implies the assertion since W n (z) /EW n (z) = X n + 1. Hence, we have 

W n (z) d 



^W n (z) 

where 



Y(z) = X(z) + l inW(D), 



r=l 

with conditions as in (7.2), which is (3.7). The integrability condition on X 
is obviously equivalent to the same condition for Y and since, as we shall see 
below, we may take s(x) > 1 arbitrarily close to 1, the condition is equivalent 
to the existence, for each x £ I, of some s(x) > 1 such that E|y(z)| s ( x ) is 
finite and bounded in a neighborhood of x, as asserted in Section 1. 

It remains to verify conditions (7.3)-(7.6). Using Lemma 8.2, we obtain, 
uniformly in each compact subset of D, 

Ain) Gy n , r {z) _ V^ z) ~\E(z)+Q{V-f)) 
r Z G n {z) Z n x ^)~ 1 (E{z)+0(n~ s )) 

n 



(1 + 0(0)- 
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[V ntr may equal 0, but that is not a problem; the cautious reader may write 
(1 + K.r)" 5 above.] 

We have V n ^ r /n — ► V r by (3.6) and thus we may, by a suitable coupling, 
assume that V ntr /n — > V r a.s. (see also Remark 3.1). Since these random 
variables are bounded by 1 and $tAi(z) — 1 > in D, dominated conver- 
gence yields A^\z) — ► A*.(z) in L s for any s > 0. This also implies that 
b( n '(z) — > b*(z) in L s . Moreover, these L s -convergences are uniform in any 
compact subset of D and arbitrary s > 1. This establishes condition (7.3). 
For bounded neighborhoods U x and arbitrary 1 < s(x) < 2, we have that 
\A* r (z)\ s ^ is uniformly bounded in z £ U x . This implies conditions (7.4) 

and (7.6), since we have ¥(I^ n) < £) -> for all £ G N as I^ n) jn -> V r and 
F(V r = 0) = 0. 

For condition (7.5), note that V r has the Beta(t + 1, (m — l)(t + 1)) dis- 
tribution. This implies that, for a > 0, we have 

Eya = T(t + a + l)((t + l)m-iy. = 1 

r t\T(m(t + 1) + a) mF(a + 1) ' 

with F given in (3.5). Let x £ A = (l/m,/3(a+)). We have 

m s 

We have ^(1) = 1. Thus, the existence of an s(x) G (1, s) with (7.5) fol- 
lows from fi4(l) < 0. To verify a negative derivative, we consider h x (s) := 
— log(g x .(s)) = log(F(s(Ai(x) — 1) + 1) — slogx. We then have 

^(l) = (A 1 (x)-l)^(A 1 (x))-logx 

mt+m—2 \ i \ i 

- s 

From logx = log(i ? (Ai(x))) and (1.4), we obtain that the only zeros of h' x (l) 
are at x = F(\J) = /3(a_) and x = F(X + ) = (3(a + ). For x = 1, we obtain, 
with Ai(l) = 2, that ^(l) > 0, thus, by continuity of h' x (l), we obtain 
h' x (l) > for all /?(«_) < x < /3(o+). 

Thus, for all /3(a_) < x < /3(a + ), there exists s(x) G (1, s) such that 
g x (s(x)) < 1. In particular, this shows (7.5). We have verified the conditions 
of Theorem 7.1 and the proof is complete. □ 

10. Reduction to the profile. We now come back to the original prob- 
lem. We know (Theorem 1.2) that the profile polynomials W n (z) satisfy a 
functional limit theorem 

(10.1) (W n (z)/EW n (z),z (Y(z),z G B) 
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for some open domain BCC including the open interval (1/m, /3(a+)) C R 
and that (Y(z),z € B) is the process (of random analytic functions) that 
satisfies the stochastic fixed point equation (3.7) with 'EY(z) = 1 and a 
certain integrability condition. 

The idea is now to reconstruct X n ^ from the limit relation (10.1). In 
particular, we want to show that 

(10.2) G A -*+ (Y((3(a)),a e 1% 

where I' (defined in Theorem 1.1) has the property that all (3 = /3(a) (for 
a E I') satisfy 1 < Ai(/5 2 ) < 2Ai(/3) - 1. 

We use the Cauchy formula and split it into two parts. More precisely, 
we fix a compact interval I c C (1/m, f3(a+)) and a small > such that the 
compact set B\ := {z £ C : |z| £ I c , | arg(z)| < </?} is contained in I?. Further, 
let J£ = {/3Ai(/?):/3€ic} and 5 2 := {z £C: \z\ S I c ,ip< \ arg(z)| < vr}. 

For a £ I' c , so that /3(a) S I c , we write 

X n , [alognl f W n (z)z-^°^~ l dz 

,L 6 J ITTI J\z\=l3(a),zeBi 

(10.3) 

+ — / W„(z)z- La]osn J- 1 tte. 



27ri J| z | =/ 3( Q )^ gB2 

We study the two integrals separately. For the first part, we define linear 
operators T n , mapping the space C(B\) of continuous functions on B\ into 
the space D(I' C ) of right-continuous functions with left limits on I' c , by 

, l/(27r»)/ k|=/ , (a)|ZeBl G(z)EW w (z)2 ! -L^"J-idz 
_Z n (GJ(a) — — — , 

^^-n, j a logn I 

(10.4) 

a G /£. 

Note that if we take G(z) = W n (z) /EW n (z) , the numerator in (10.4) equals 
the first term on the right-hand side of (10.3). The second term will be shown 
to be small and thus T n (W n (z) /KW n (z)) is an approximation of 

(^n,Lalognj/^-^n,L<*lognJ > a e ^c)- 

We begin by studying T n in Lemma 10.1. We will then, in Lemma 10.2, 
show that the second term of (10.3) is sufficiently small to be neglected and 
Theorem 1.1 will follow. 

We will use the supremum norm; for convenience, we write, from any set 

E, 



||/|| B :=sup|/|. 

E 
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Lemma 10.1. (i) The operators T n are uniformly continuous with re- 
spect to the supremum norm. More precisely, there exists a constant C > 
(depending on I c and B\) such that 

\\T n (F)-T n (G)\\ r <C-\\F-G\\ Bl . 



(ii) If F n ^ F uniformly on B\, then T n {F n ) — > F uniformly on V c . 

Proof, (i) Suppose that ||F — G\\b 1 < $ and that a G I' c . Then 
\T n (F)(a)-T n (G)(a)\ 

(10.5) 

< — -!- / \EW n (z)z^ a ^- 1 \\dz\. 

^ X n,[alogn] ^ J\z\=f3(a),zeB 1 

First, suppose that k = alogra is an integer. Lemma 8.3 yields an estimate 
of KX n _k and its proof, in particular (8.6) and (8.7), yields an estimate of 
the same order for the integral in (10.5). Hence, (10.5) implies that 

(10.6) \T n (F)(a)-T n (G)(a)\<C5 

for some C, uniformly in a G l' c , such that alogn is an integer. 

For general a, we define a' = [a log n\ / log n and note that \oi — a\ < 
1/logn, so \f3(a') — /3(a) | = 0(l/logn). It is easily checked that if we replace 
a' by a in the estimate of X n ± in Lemma 8.3, then the result will change 
by at most a factor = e 0(l \ It follows that 

(10.7) EX niLalognJ > (lognrVV^))-" 10 ^)- 1 

and that (10.6) holds uniformly in all a G I' c , possibly with a larger constant. 

For (ii), let F G C{B\). By using standard saddle point techniques, as in 
the proof of Lemma 8.3, we have 

lim -— -L f F(z)EW n (z)z^ alo ^~ 1 dz = F((3(a)), 

n^oo EA„ )Lalogn j 27T1 J|a|=/3( a ),zeBi 

that is, T n {F) — > F, uniformly on I' c . Finally, if F n — > F uniformly on B\, 
then T n (F n ) — T n (F) — > 0, by (i), and consequently T n (F n ) — ► F uniformly 
on V c . This completes the proof of the lemma. □ 

Next, we focus on the error 

X n,[alogn\ ( W n (z) 



a log n 

(10.8) 

y{2^)f\ A=p{a) ^ B2 W n {z)z-^^dz 
^n, [alognj 

where we recall that B2 = {z G C : \ z\ G I c , <p < \ arg(z)| < tt}. 
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Lemma 10.2. For every compact interval I c contained in I = {(3 > 0:1 < 
A 1 (/3 2 )<2A 1 (/3)-l}, 



sup 

as n — > oo . 
Proof. Let 



If we further define 
(10.9) 



EX, 



n, [a log raj 







\W n {z)z-\- aXo ^\ 



EX 



H n {z) :-- 



n, [a log raj 

W n (z) 



n Ai(|s|)-l' 

then it follows from (10.7) that, uniformly for a G I' c and z G Bi with \z\ = 

/3(a), 

(10.10) G n>a (z) <\H n {z)\{\ogn) l l 2 . 
Let z G i?2 and let 5 n = 1/logn. Since W n is analytic, 

(10.11) \ Wn {z)\ 2 < ^-r ( \W n (z)\ 2 dm(w), 

no* J\w-z\<S n 

where m is the two-dimensional Lebesgue measure. 

Further, if \w — z\ < S n , then n Al (l z D _1 and n Al ^ w ^ _1 differ by at most a 
constant factor. Hence, 



(10.12) 



\H n (z)\ 2 <(logn) 2 \H n {z)\ 2 dm(w). 
J \w—z\<S n 



Fix 5 > (we will be more precise below) and let I>2 = {w : dist(it?, B2) < 
5}. Then, by (10.12), if n is sufficiently large that 5 n < 5, we have 

(10.13) sup \H n (z)\ 2 < (logn) 2 [ \H n {z)\ 2 dm(z). 

zeB 2 JB\ 

We now use Lemma 8.4. By Lemma 8.1 and the assumption I c C /, 
max(Ai(|z| 2 ) - l,2»(Ai(*)) -2,0) < 2Ai(|z|) -2 

on i?2- By continuity, this also holds on B% if 5 > is small enough and, by 
Lemma 8.4 and compactness, there exists ij > such that 

E\W n (z)\ 2 <n 2Xl( -^~ 2 - r > 

uniformly for z G B%. Thus by (10.9), 



E\H n (z)\-^n 



<r,,~ r > 
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and by (10.13), 

e( sup \H n (z)\ 2 ) < (logn) 2 f K\H n (z)\ 2 dm(z) < (lognfn^. 

Hence, sup 2eBa \ogn\H n {z)\ — — » and the result follows by (10.10). □ 

Proof of Theorem 1.1. By Theorem 1.2, W n (z)/EW n (z) y in 
7~C(B) and thus for every compact subset £> c in the space C(B C ) with the 
uniform topology. Hence, Lemma 10.1(h) and [2], Theorem 5.5, imply that 

T n {W n /mV n )^Y mD(l' c ). 
Finally, Lemma 10.2 and (10.8) imply that, provided I c C /, 

^n, La logn] / W n (z) \ p 

uniformly on I' c . The theorem then follows. □ 

Remark 10.1. The reason that we have to restrict ourselves to the 
interval I' (and cannot extend our result to a larger interval; compare with 
the discussion of the critical values in Section 1) is that we use an L 2 -estimate 
for | W n (,z)| in the proof of Lemma 10.2 that only works if a G I'. In fact, I' 
is the largest interval in which we have L2-convergence to the process Y{z). 

However, it is very likely that one can prove similar estimates for E| W n (z) \ p 
for any p > 1 and that our method of proof, using the version of (10.11) for 
pth powers, would then prove Theorem 1.1 for the largest possible interval 
(a ,a+). 

11. The external profile. In this final section, we will discuss a variation 
of Theorem 1.1 which concerns a similarly defined profile process. 

The external profile Y n ^ denotes the number of free positions at level k in 
a tree with n keys. A free position is a position where the (n + l)st key can 
be placed, for example, Y^o = m — 1 and Yq^ = for k > 1. More precisely, 
we have 

fill] v , -yW +Y (2) -l i yM 

jointly in k > for every n > m — 1, where the random vector V n = (V^ i, V n 2, 

. . . , V n ^ m ) is as in Section 2 and is the same for every k > 0, and = 

(^nfc)fc>0; 3 = 1, are independent copies of Y n that are also inde- 

pendent of V n . The initial conditions are Y n ^ = for n > m — 1, and 
for n < m — 2, we simply have Y H) q = m — 1 — n for < n < m — 2 and 
Y n ^ = for k > 1. However, we should mention that the initial conditions 
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for Yo,o> ^T,o> •••) ^m-2,o only affect implicit constants in our analysis. The 
limit theorem (Theorem 11.1) is not affected. For example, one can also use 
Y nj o = 1 for < n < m — 2 in order to count the number of nodes where one 
can place a new item. 

Let U n (z) =J2k^n,kZ k denote the random external profile polynomial. 
By (11.1), it is recursively given by U n {z) = m — 1 — n for < n < m — 2 and 

(11.2) U n (z)^zU^ A (z)+zU^ 2 (z) + --- + zU^l(z), n>m-l, 

where U^\z), j = 1, . . . ,m, are independent copies of Ue(z) that are inde- 
pendent of V n , £ > 0. From this relation, we obtain (similarly to the above) 
a recurrence for the expected external profile polynomial EU n {z). We have, 
using (2.2), for n > mt + m — 1, 

n-i (A ( n-e-i \ 

(11.3) EU n (z) = mzY J tA(m ~y +> " l - 2) EU e (z). 

(=0 \mt+m—l) 

By [9, Theorem l(i)], we obtain, as above, 

(11.4) EU n {z) ~E{z)n Xl ^- 1 

for some analytic function E{z) with E(z) > for z > 0. Moreover, this 
limit relation is true for all z G D-t , not only for z G D\ , as we will see in a 
moment. Since D\ C -D-t, we can expect that corresponding limit theorems 
hold for a larger range. 

The fact that (11.4) holds for z G D_t needs some explanation. If we just 
use [9], Theorem l(i), this creates the impression that z G -Do is the largest 
region for (11.4) since [9], Theorem l(i) assumes that 5i(Ai(z)) > 0. Further- 
more, the indicial polynomial A(A; z) always has the roots 0, —1, —2, . . . ,—t + 
1, that is, if K(Ai(z)) < 0, then the dominant root of A(A;z) is always 0. 

However, the contribution of a simple root X(z) of A(A; z) to the behav- 
ior of ~EU n (z) is of the form c(z)(— l) n (~ x ( z ^) . This implies that the roots 
0, —1, —2, . . . , — t + 1, provided they are simple, only matter for n <t, that is, 
they do not affect the asymptotics of MU n (z). Hence, if Xi(z), the dominant 
root of F(Q) = z, is different from 0, —1, —2, . . . , — t + 1, then (11.4) is also 
true. Further, by Theorem A.l, (11.4) holds uniformly for any compact set 
contained in D_ t with X\(z) ^ {0, —1, —2, . . . ,— 1 + 1}. 

Nevertheless, a little bit of more careful analysis reveals that the excep- 
tional values 0, —1, —2, . . . , — t + 1 are only present in the analysis, but not in 
the asymptotic result. The limit relation (11.4) extends continuously to all 
z G D-t. For example, if t > 1 and Xi(z) is close to zero, then it follows that 
the power series of EU n (z) can be represented as (compare to the Appendix) 

E EU ^ n = nM*)m*) ( (1 _l )Xl(z) - 1 

n>0 Vl •>/ 

(11.5) 

+ smaller order terms. 
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Further, if Ai(zo) = 0, then A(A; z) has a double zero and we have 

(11.6) E,U n (zo)( n = E(zq) log + smaller order terms. 

n>0 ~~ ^ 

Note that (11.5) and (11.6) are consistent for z — > zq and they imply that we 
also have (11.4) uniformly in a neighborhood of z = Zq. Similar phenomena 
appear if Xi(z) is close to —1, —2, . . . , — t + 1. 

It can also be shown, using (an analytic extension of) the formula in [9], 
Theorem l(i), as for E(z) in Lemma 3.1, that E(z) > if z > is real. 

The expected external profile MY n k is, thus, given by the following asymp- 
totic formula which can be proven exactly as Lemma 8.3. 

Lemma 11.1. Suppose that a±,a2 with < ct\ < ai < oo are given and 
let (5{a) be defined by (3{a)\' l {f3{a)) = a. Then 

EY n , k = 7 i(a "' fcj)n (1 + O(aogn)- 1 / 2 )) 

^2ir(a ntk + (3{a n . k ) 2 \'{{(3{a n . k )))\ogn 

uniformly for a n ^ = k/ log n € \a\ , 0:2] as n, /c — > 00 . 

Hence, we can proceed as above and obtain the following variation of 
Theorem 1.1, with some minor differences in the proofs. [For example, we 
use the fact that E(l + V n ^ r ) a = 0(n a ) for any a > — t — 1; we omit the 
verification of this estimate.] 

Theorem 11.1. Let m>2 and t > be given integers and let {Yn,k)k>o 
be the external profile of the corresponding random search tree with n keys. 

LetJ = {(3>0: Ai(/3 2 ) < 2Ai(/3) - 1} and J' = {f3\[(P) : (3 £ J}. We then 
have 

(11.7) ( I^ alOSni ,a e A ± (YW(a)),a G J'), 

\ n, [a lognj / 

where Y(z) is as in Theorem 1.1. 



The difference between Theorems 1.1 and 11.1 is that Theorem 11.1 is 
true for a larger range for /c/logn since I 1 C J'. The reason is that the 
internal profile X n & has a phase transition at level ao that is not present 
for the external profile (compare with the discussion of critical constants in 
the Introduction). 

Further, note that we can also deal with the process R H: k '■= { m — ^)m k — 
X n ^ that can be approximated by Y(z) in the range a £ (0, ao) H J'. (We 
do not work out the details here.) 
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We close this section with a vector- valued generalization of Theorem 1.1. 
As we know from the introduction, a node in an m-ary search tree stores one 
or several of the keys up to at most m — 1. This means that we can partition 
the nodes into types. We say that a node has type j (j E {1, 2, . . . ,m — 1}) if 
it stores exactly j keys. Further, we can also extend this definition to j = 
if we define nodes of type to be external nodes. 

(i) 

Now, let X^ k denote the number of nodes of type j at level A: in a random 
m-ary search tree with n keys. We can then prove the following theorem. 



Theorem 11.2. Let m>2 and t > be given integers and let {X^\, . . . , 

xjrt k>0, be the random profile vector of the random search tree with 
n keys. 

Let / = {/?>0:KA 1 (/3 2 )<2Ai(/?)-l} and I' = {(3\i(/3) : € /}. We 
then have 



(11. 



v-(O) X (m-1) 
n,L»lognJ n,LalognJ 

n,LalognJ n,LalognJ 



; a e A 



±>(Y(0(a)),...,Y<J3(a)y,aer). 



We do not work out the details, but the same proof techniques as for 
the proof of Theorem 1.1 also work here, using Remark 7.1. We obtain 
convergence to a vector (Yi(z))™^ 1 satisfying 

<Mz)) t ^ {zVt {z) - X YP{z) + zVt {z) ~ l Y^ {z) + ■■■ + zVW-^Wi, 

where (Y^(z))i, 1 < j < m, are independent copies of (Yj.(z))i, and this 
equation is solved by Yq(z) = ■ ■ ■ = Y m _i(z) = Y(z). It follows that the pro- 

(i) 

files X^' k , 0<f <m-l, are asymptotically proportional. 

Note that (11.8) is a functional limit result. Hence, joint convergence for 
several different arguments a follows. In particular, we obtain limits for 
vectors (X^°) , ...... X^ ^ , ,) with fixed an, ■•■ ,a m -i • 

\ n,L«olognJ' ' n, L« m _i lognj / u ' ' 1 

Note that Theorem 1.1 follows directly from Theorem 11.2, since 
X n , k = XV+2X® +... + (m -l)^- 

APPENDIX A: DETAILED PROOFS OF LEMMAS 8.2 AND 8.4 

In Lemmas 8.2 and 8.4, we used results by Chern, Hwang and Tsai [9], 
where we claimed uniformity in z (in certain compact sets) . This uniformity 
can be verified by a tedious checking of the proofs in [9] (Hwang, personal 
communication), but, for completeness, we give a detailed proof here; see 
also Chern and Hwang [8] for the case m = 2. 
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Fortunately, we are in the situation where the relevant generating func- 
tions can be analytically continued outside the unit disc so that the singu- 
larity analysis of Flajolet and Odlyzko [13] (see also Flajolet and Sedgewick 
[14], Chapter 6) applies. As pointed out in [9], page 197, this simplifies the 
arguments considerably, so we consider only this case. 

We introduce the generating function \Jj(C; z) := J2n>o^^n(z)C n ■ Let A(9; z) 
be the polynomial (in 6) of degree r : = mt + m — 1 defined in (3.4) and let 
$ denote the differential operator (1 — C)g£- Further, let b n (z) :=MW n (z) 
for n <r and b n (z) := m — 1 for n>r, and define the generating function 
g{C,;z) := J2 n >o b n (z)C n - Then, as is shown in [9], (3.3) is equivalent to the 
differential equation (in C, with z fixed) 

(A.l) A(0; z)¥(C; z) = 0(C; z) := (l - (Y^g(C, z). 

Note that g(C; z), for every z, differs from (m — 1)(1 — C)" 1 by a polynomial 
in C of degree less than r. Hence, 

(A.2) <MC; z) := (m - 1)(1 - 0^(1 - C)" 1 = (m - l)r!(l - C)" 1 - 

We study solutions to (A.l) in some generality and state a theorem where 
A and (f> can be rather arbitrary polynomials and analytic functions, respec- 
tively, depending on a parameter z. (The parameter set K can be any set, 
although we only need subsets of the complex plane for the present paper.) 

A A- domain is a domain of the type 

A(R,5) :={z£C:\z\ < R and |arg(z-l)| > vr/2 - 5} 
for some R > 1 and 5 £ (0, vr/2) . 

Theorem A.l. Let r > 1 and for each z € K, let A z (6) be a monic 
polynomial in of degree r, with coefficients that are bounded functions of 
z. Moreover, let 4> Z (C) be an analytic function of C, in the unit disc for every 
z 6 K and let ^(C) be a formal power series that solves the differential 
equation 

(A.3) A,(0)¥*(C) = MC). 

We denote the roots of A z (\) = (counted with multiplicities) by \j(z), 
j = 1, . . . , r, arranged in decreasing order of their real parts: diXi(z) > "SiX2(z) > 

(i) Assume that, for each z S K , <p z (C) extends to an analytic function 
in a fixed A-domain A = A(R, 5) and that for some constants a £ (— oo, oo), 
d > and r\ £ (0, 1/2), uniformly in all z G K and Q £ A, 

1 j — \o(|i — CM log |i - Cll d ), |i - CI < v, 
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and that, again uniformly in z G K , 

(A.5) |^(0) = 0(1), fc = 0,...,r-l. 

E'ac/i ^2(0 i^en converges in the unit disc and extends to an analytic func- 
tion in A such that, uniformly in all z £ K and £ G A, 

v m-/ 0(1) ' |i-CI>»7, 

zlu ~\o(|i- cr^^^iiogii - cn d+r ), 1 1 — ci <v- 

(ii) Assume, further, that e > E\ > and that, for all z € K, 3iA2(z) < 
5RAi(z) — e and a < ^StXi(z) — e. (The first assumption is void if r = 1.) Then, 
for some function c(z), uniformly in all z G K and C G A, 

¥*(C) = c( z )(l - £)-AiM + 0(|1 - £|«-«»iW). 

In particular, this holds for some e\ > if K is a compact topological space, 
the coefficients of A z are continuous functions of z and 3?Ai(z) > 9?A2(z) 
and KAi(z) > a for each z G K . 

PROOF. We have K z {-&) = EILiC^ - X\(z)). The roots Xi(z) are not al- 
ways continuous functions of the coefficients of A z (because of ambiguity in 
labeling the roots), but maxj |Aj(z)| is, and since the coefficients are bounded, 
it follows that sup{|Aj(;z)| : z G K,i = 1, . . . ,r} < 00. We may thus treat the 
factors § — Xi(z) one by one and, by induction, it suffices to prove part (i) 
for the case r = 1, A z ($) = •& — X(z), where X(z) is bounded, provided we 
show that the bounds are also uniform in a £ A for any bounded set A. 

In this case (see [9], Lemma 1), it is easily seen that, for each y, there 
is a unique power series *$> z satisfying (A. 3) with ^ 2 (0) =y. Moreover, the 
solution z is given by the analytic function 

(A.6) f,(C) = tt*(0)(l - C)- A(z) + (1 - C)~ A(2) /V - w^-'Uw) dw. 





Clearly, (A.6) defines ^ z as an analytic function in A because <ft z is such a 
function. 

For (i), it only remains to estimate this solution for £ G A, with a = 0(1) 
and A = X(z) = 0(1). We first note that the value of 77 is immaterial, so we 
may, for convenience, assume that rj < R — 1. For |1 — CI > ^ we can choose 
an integration path in (A.6) of bounded length and contained in the region 
|1 - w\ > rj, and it follows that ^ z (() = 0(1). 

Now, assume that C £ A and |1 — CI Let 7 := arg(l — C) and C' := 
1 — r/e 17 . Then 

r \l-w) x ~ 1 (p z (w)dw= / C (l-w) x ~ 1 <f> z (w)dw+ / C (1-^) A ~V 2 (^)^ 



Jo J(' 
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< 1+ r x* X-1 2r a |log2:| d dx 

J\i-c\ 



< | log 1 1 — CI 1^ t x- {a ~ ux) - 1 dx. 
J\l-C\ 



li-CI 

For any real [3 ^ and < y < 1, by the mean value property for 1 1— > 

e tlog(l/iO 

1 _ 1 _ ^10g(l/») _ 1 

y x dx= ^r = — — 

< | logyle^ ) 10 ^ 1 ^) = | logy|y-^ v °). 
This is evidently also true for (3 = and thus 

JO 

Consequently, (A. 6) yields, recalling |1 — C| < rj < 1/2, 

i^(oi < (i + 1 log ii - cn d+1 ii - cr«— KA > v °))i(i - cr A i 
<ii og |i-cii d+1 ii-cr (QVSftA) . 

For part (ii), we factorize A z (<&) = A*(#)(# - \i{z)) and let **(C) = (■&- 
AiW)^(C). Thus, A|(i?)**(C) = &(C). By (i) applied to A* (or directly if 
r = 1), for |1 — CI ^ 7 ?) 

= oni - cr (av3?Aa(z)) i iog|i - cn d+r_1 ) 

(A.7) 

= o(ii-cr !RAi(z)+ei ). 

We now use (A. 6), with X(z) = Xi(z) and <p z replaced by ^>* z . By (A.7), 
the integral c\{z) := Jq(1 — w) Al ^- l ~ 1 ^*(w) dw converges and if we define 
c(z) := ^z(O) +c\(z), then, again using (A.7) we have 

^(C)-c(z)(l-C)- Al(z) 

= -(1 - C)" Al(z) / (1 - «;) Al W -1 *;^) d«> 



c 

, |i — CI 

< h _ £|-&Ai(*) / a .5RAi(z)-l-5RAi(2)+£i dx 



< |]_ _ £|-5RAi(z)+ei 

provided |1 — CI < V- The case |1 — C| > ?7 follows immediately from ^(C) = 
O(l). 

The final statement follows because, under these assumptions, ^tXi(z) and 
9?A2(z) are continuous functions of z E K, so the compactness of K yields 
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the existence of an e > such that $t\%(z) — KA2(z) > e and %l\i(z) — a > e 
for all zeK. □ 

Proof of Lemma 8.2. We use Theorem A.l with A z (£) = A((;z), 
**(C) = and <t> z (Q =(/>((;z), as above. [By (A.2), cb z (() is actually 

independent of z.) We see from (A.2) that (A. 4) holds with a = 1 and d = 0. 
Hence, Theorem A.l(i) yields, for z £ K, £ G A, 



and Lemma 8.2(h) follows by standard singularity analysis (see, e.g., [14], 
Chapter 6). 

Similarly, Lemma 8.2(i) follows easily from Theorem A. 1(h). The constant 
E(z) is necessarily the same as in Lemma 3.1. □ 

Proof of Lemma 8.4. This time, we use (8.12). Let a n {z) :=E|W n (z)| 2 
and let b n {z) be the O term in (8.12). (8.12) then implies that the analogue 
of (A.l) holds for the generating functions of a n and b n , with A(£, z) replaced 
by A(£, \z\ 2 ). However, it is not clear that these generating functions extend 
to a A-domain. Therefore, we instead take g z (C) := 0MC), where 



and C is a constant chosen sufficiently large that |&n(-z)| < [C n ]<?(0 for an z G 
K and n > 0. This is possible because [( n ]h(() = e(n max ^ 25R ( Al ( z ))- 2 ' >(logn) 2D ). 
We then let A z (() := A(C, \z\ 2 ) and dehne ^ 2 (C) to be the solution of (A. 3) 
with initial conditions [C n ] x I / z(C) = ^1 Wn^)) 2 for n <r. It now follows from 
(8.12) and induction that E|iy n (z)| 2 < [C n ]<I^(C) for all n, so it suffices to 
estimate [C n ]^z(0- 

Clearly, (A. 4) is satisfied with a = max{25R(Ai(z)) — 1, 1} and d = 2D. 
Theorem A.l(i) thus yields, uniformly for z G K, 




0(1), 

o(ii-ci- (!RAi(2)vi) iiog|i-cin 



1-C|>»7, 

i-C\<v, 



MO :=(1-C) 



max{2SR(Ai(2))-l,l}^_-^ g^ _ Q iq 



2D 




0(1), 

o(|i-CI 




Standard singularity analysis then yields 



[C]^ z (0 = O(n 



max 



{A 1 (| 2: | 2 )-l,2SR(A 1 (2))-2,0}( logri )2D+r~ ) 



and (8.10) follows. □ 
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